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TRANSGENIC MICE CONTAINING TARGETED GENE DISRUPTIONS 

Field of the Invention 

The present invention relates to transgenic animals, compositions and methods relating to the 
characterization of gene function. 
5 Background of the Invention 

Many medically significant biological processes are mediated by proteins participating in 
signal transduction pathways that involve G-proteins and/or second messengers such as cAMP. The 
membrane protein gene superfamily of G-protein coupled receptors (GPCRs) include a wide range of 
biologically active receptors, such as hormone, viral, growth factor and neuroreceptors. GPCRs have 

10 been characterized as having seven putative transmembrane domains (designated TM1, TM2, TM3, 
TM4, TM5, TM6, and TM7), which are believed to represent transmembrane a-helices connected by 
extracellular or cytoplasmic loops. Most G-protein coupled receptors have single conserved cysteine 
residues in each of the first two extracellular loops which form disulfide bonds that are believed to 
stabilize functional protein structure. G-protein coupled receptors can be intracellularly coupled by 

15 heterotrimeric G-proteins to various intracellular enzymes, ion channels and transporters. Different 
G-protein ct-subunits preferentially stimulate particular effectors to modulate various biological 
functions in a cell. 

Over the past 15 years, nearly 350 therapeutic agents targeting 7 transmembrane receptors 
have been successfully introduced onto the market. As these receptors have an established, proven 
20 history as therapeutic targets, a clear need exists for identification and characterization of GPCRs 
which can play a role in preventing, ameliorating or correcting dysfunctions or diseases. 

Given the importance of GPCRs, a clear need exists for identification and characterization of 
GPCRs which can play a role in preventing, ameliorating or correcting dysfunctions or diseases. 

Summary of the Invention 
25 The present invention generally relates to transgenic animals, as well as to compositions and 

methods relating to the characterization of gene function. 

The present invention provides transgenic cells comprising a disruption in a targeted gene. 
The transgenic cells of the present invention are comprised of any cells capable of undergoing 
homologous recombination. Preferably, the cells of the present invention are stem cells and more 
30 preferably, embryonic stem (ES) cells, and most preferably, murine ES cells. According to one 

embodiment, the transgenic cells are produced by introducing a targeting construct into a stem cell to 
produce a homologous recombinant, resulting in a mutation of the targeted gene. In another embodi- 
ment, the transgenic cells are derived from the transgenic animals described below. The cells derived 
from the transgenic animals includes cells that are isolated or present in a tissue or organ, and any cell 
35 lines or any progeny thereof. 
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The present invention also provides a targeting construct and methods of producing the 
targeting construct that when introduced into stem cells produces a homologous recombinant. In one 
embodiment, the targeting construct of the present invention comprises first and second polynucle- 
otide sequences that are homologous to the targeted gene. The targeting construct also comprises a 
5 polynucleotide sequence that encodes a selectable marker that is preferably positioned between the 
two different homologous polynucleotide sequences in the construct. The targeting construct may 
also comprise other regulatory elements that may enhance homologous recombination. 

The present invention further provides non-human transgenic animals and methods of pro- 
ducing such non-human transgenic animals comprising a disruption in a GPCR gene. The transgenic 
10 animals of the present invention include transgenic animals that are heterozygous and homozygous for 
a mutation in the GPCR gene. In one aspect, the transgenic animals of the present invention are 
defective in the function of the GPCR gene. In another aspect, the transgenic animals of the present 
invention comprise a phenotype associated with having a mutation in a GPCR gene. 

The transgenic animals of the present invention include transgenic animals that are hetero- 
15 zygous and homozygous for a mutation in the melanocortin-3 receptor gene. In one aspect, the trans- 
genic animals of the present invention are defective in the function of the melanocortin-3 receptor 
gene. In another aspect, the transgenic animals of the present invention comprise a phenotype 
associated with having a mutation in a melanocortin-3 receptor gene. In a preferred embodiment, the 
non-human transgenic animals with a mutation in the melanocortin-3 receptor gene comprise 
20 abnormalities in the kidney. In another preferred embodiment, the melanocortin-3 receptor mutants 
demonstrate unilateral renal agenesis. In yet another preferred embodiment, the melanocortin-3 
receptor mutants demonstrate passive, hypoactive behavior. 

The transgenic animals of the present invention also include transgenic animals that are 
heterozygous and homozygous for a mutation in the 5-HT-2B gene. In one aspect, the transgenic 
25 animals of the present invention are defective in the function of the 5-HT-2B gene. In another aspect, 
the transgenic animals of the present invention comprise a phenotype associated with having a muta- 
tion in a 5-HT-2B gene. In a preferred embodiment, the non-human transgenic animals of the present 
invention comprise abnormalities in embryonic development In another preferred embodiment, the 
non-human transgenic animals of the present invention die between embryonic day 8.5 and 9.5. 
30 The present invention further includes transgenic animals that are heterozygous and homozy- 

gous for a mutation in the chemokine receptor 9 A gene. In one aJspect, the transgenic animals of the 
present invention are defective in the function of the chemokine receptor 9A gene. In another aspect, 
the transgenic animals of the present invention comprise a phenotype associated with having a 
mutation in a chemokine receptor 9A gene. In a preferred embodiment, the non-human transgenic 
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animals of the present invention demonstrate a decrease in agility, coordination or balance as 
compared to normal animals. 

The present invention also includes transgenic animals that are heterozygous and homozygous 
for a mutation in the glucocorticoid-induced receptor gene. In one aspect, the transgenic animals of 
5 the present invention are defective in the function of the glucocorticoid-induced receptor gene. In 
another aspect, the transgenic animals of the present invention comprise a phenotype associated with 
having a mutation in a glucocorticoid-induced receptor gene. In another preferred embodiment, the 
non-human transgenic animals of the present invention demonstrate hyperactivity and decreased anxi- 
ety. In yet another preferred embodiment, the non-human transgenic animals of the present invention 

10 demonstrate decreased propensity towards behavioral despair or depression. 

The present invention also provides methods of identifying agents capable of affecting a 
phenotype of a transgenic animal. For example, a putative agent is administered to the transgenic 
animal and a response of the transgenic animal to the putative agent is measured and compared to the 
response of a "normal" or wild type mouse, or alternatively compared to a transgenic animal control 

15 (without agent administration). The invention further provides agents identified according to such 
methods. The present invention also provides methods of identifying agents useful as therapeutic 
agents for treating conditions associated with a disruption of the GPCR gene. 

The present invention further provides a method of identifying agents having an effect on 
GPCR expression or function. The method includes administering an effective amount of the agent to 

20 a transgenic animal, preferably a mouse. The method includes measuring a response of the transgenic 
a nim a l , for example, to the agent, and comparing the response of the transgenic animal to a control 
animal, which may be, for example, a wild-type animal or alternatively, a transgenic animal control. 
Compounds that may have an effect on GPCR expression or function may also be screened against 
cells in cell-based assays, for example, to identify such compounds. 

25 The invention also provides cell lines comprising nucleic acid sequences of a GPCR gene. 

Such cell lines may be capable of expressing such sequences by virtue of operable linkage to a 
promoter functional in the cell line. Preferably, expression of the GPCR gene sequence is under the 
control of an inducible promoter. Also provided are methods of identifying agents that interact with 
the GPCR gene, comprising the steps of contacting the GPCR gene with an agent and detecting an 

30 agent/GPCR gene complex. Such complexes can be detected by, for example, measuring expression 
of an operably linked detectable marker. 

The invention further provides methods of treating diseases or conditions associated with a 
disruption in a GPCR gene, and more particularly, to a disruption in the expression or function of the 
GPCR gene. In a preferred embodiment, methods of the present invention involve treating diseases or 

35 conditions associated with a disruption in the GPCR gene's expression or function, including 
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administering to a subject in need, a therapeutic agent that effects GPCR expression or function. In 
accordance with this embodiment, the method comprises administration of a therapeutically effective 
amount of a natural, synthetic, semi-synthetic, or recombinant GPCR gene, GPCR gene products or 
fragments thereof as well as natural, synthetic, semi-synthetic or recombinant analogs. 
5 The present invention further provides methods of treating diseases or conditions associated 

with disrupted targeted gene expression or function, wherein the methods comprise detecting and 
replacing through gene therapy mutated GPCR genes. 
Definitions 

The term "gene" refers to (a) a gene containing at least one of the DNA sequences disclosed 

10 herein; (b) any DNA sequence that encodes the amino acid sequence encoded by the DNA sequences 
disclosed herein and/or; (c) any DNA sequence that hybridizes to the complement of the coding 
sequences disclosed herein. Preferably, the term includes coding as well as noncoding regions, and 
preferably includes all sequences necessary for normal gene expression including promoters, 
enhancers and other regulatory sequences. 

15 The terms "polynucleotide" and "nucleic acid molecule" are used interchangeably to refer to 

polymeric forms of nucleotides of any length. The polynucleotides may contain deoxyribonucleo- 
tides, ribonucleotides and/or their analogs. Nucleotides may have any three-dimensional structure, 
and may perform any function, known or unknown. The term "polynucleotide" includes single-, 
double-stranded and triple helical molecules. 

20 "Oligonucleotide" refers to polynucleotides of between 5 and about 100 nucleotides of single- 

or double-stranded DNA. Oligonucleotides are also known as oligomers or oligos and may be 
isolated from genes, or chemically synthesized by methods known in the art A "primer" refers to an 
oligonucleotide, usually single-stranded, that provides a 3-hydroxyl end for the initiation of enzyme- 
mediated nucleic acid synthesis. The following are non-limiting embodiments of polynucleotides: a 

25 gene or gene fragment, exons, introns, mRNA, tRNA, rRNA, ribozymes, cDNA, recombinant polynu- 
cleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA 
of any sequence, nucleic acid probes and primers. A nucleic acid molecule may also comprise modi- 
fied nucleic acid molecules, such as methylated nucleic acid molecules and nucleic acid molecule 
analogs. Analogs of purines and pyrimidines are known in the art, and include, but are not limited to, 

30 aziridinycytosine, 4-acetylcytosine, 5-fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2- 
thiouracil, 5-carboxymethyl-aminomethyluracil, inosine, N6-isopentenyladenine, 1-methyladenine, 1- 
methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, pseudouracil, 5-pentylnyluracil and 2,6- 
diaminopurine. The use of uracil as a substitute for thymine in a deoxyribonucleic acid is also 

35 considered an analogous form of pyrimidine. 
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A "fragment" of a polynucleotide is a polynucleotide comprised of at least 9 contiguous 
nucleotides, preferably at least 15 contiguous nucleotides and more preferably at least 45 nucleotides, 
of coding or non-coding sequences. 

The term "gene targeting" refers to a type of homologous recombination that occurs when a 
5 fragment of genomic DNA is introduced into a mammalian cell and that fragment locates and 
recombines with endogenous homologous sequences. 

The term "homologous recombination" refers to the exchange of DNA fragments between 
two DNA molecules or chromatids at the site of homologous nucleotide sequences. 

The term "homologous" as used herein denotes a characteristic of a DNA sequence having at 
10 least about 70 percent sequence identity as compared to a reference sequence, typically at least about 
85 percent sequence identity, preferably at least about 95 percent sequence identity, and more 
preferably about 98 percent sequence identity, and most preferably about 100 percent sequence 
identity as compared to a reference sequence. Homology can be determined using a "BLASTN" 
algorithm. It is understood that homologous sequences can accommodate insertions, deletions and 
15 substitutions in the nucleotide sequence. Thus, linear sequences of nucleotides can be essentially 
identical even if some of the nucleotide residues do not precisely correspond or align. The reference 
sequence may be a subset of a larger sequence, such as a portion of a gene or flanking sequence, or a 
repetitive portion of a chromosome. 

The term "target gene" (alternatively referred to as "target gene sequence" or 'target DNA 
20 sequence" or ''target sequence") refers to any nucleic acid molecule or polynucleotide of any gene to 
be modified by homologous recombination. The target sequence includes an intact gene, an exon or 
intron, a regulatory sequence or any region between genes. The target gene comprises a portion of a 
particular gene or genetic locus in the individual's genomic DNA. As provided herein, the target gene 
of the present invention is a GPCR gene. A "GPCR gene" refers to: a melanocortin-3 receptor gene, 
25 5-HT-2B gene, a chemokine receptor 9A gene, a glucocorticoid-induced receptor gene, an orphan 
GPR10 (UHR-1) gene, an orphan GPR14 gene, an orphan GPR15 gene, a beta chemokine receptor 
(E01) gene, an endothelial differentiation GPCR 3 (EDG3) gene, an ATP receptor P2Ulgene, or an 
adenosine 3 receptor gene. 

A "melanocortin-3 receptor gene" refers to a sequence comprising SEQ ID NO: 1 or 
30 comprising the sequence encoding the melanocortin-3 receptor [identified in Genbank as Accession 
No.: X74983; GI NO: 400473], In one aspect, the coding sequence of the melanocortin-3 receptor 
gene comprises SEQ ID NO:l or comprises the melanocortin-3 receptor gene identified in Genbank as 
Accession No.: X74983;GINO: 400473. 

A "5-HT-2B gene" refers to a sequence comprising SEQ ID NO:5 or comprising the sequence 
35 encoding the 5-HT-2B [identified in Genebank as Accession No.: Z151 19; GI NO: 54089]. In one 
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aspect, the coding sequence of the 5-HT-2B gene comprises SEQ ID NO:5 or comprises the 5-HT-2B 
gene identified in Genebank as Accession No.: Z151 19; GI NO: 54089. 

A "chemokine receptor 9A gene" refers to a sequence comprising SEQ ID NO:9 or 
comprising the sequence encoding the chemokine receptor 9A [identified in Genbank as Accession 
5 No.: U45982;GINO: 1245054]. In one aspect, the coding sequence of the chemokine receptor 9A 
gene comprises SEQ ID NO:9 or comprises the chemokine receptor 9A gene identified in Genbank as 
Accession No.: U45982;GINO: 1245054. 

A "glucocorticoid-induced receptor gene" refers to a sequence comprising SEQ ID NO: 1 or 
comprising the sequence encoding the glucocorticoid-induced receptor [identified in Genbank as 
10 Accession No.: M8048 1 ; GI NO: 1935 16] . In one aspect, the coding sequence of the glucocorticoid- 
induced receptor gene comprises SEQ ID NO:l or comprises the glucocorticoid-induced receptor 
gene identified in Genbank as Accession No.: M80481; GI NO: 193516. 

An "orphan GPR10 (UHR-1) gene" refers to a sequence comprising SEQ ID NO: 17 or 
comprising the sequence encoding the orphan GPR10 (UHR-1) [identified in Genbank as Accession 
15 No.: S77867; GI NO: 998527]. In one aspect, the coding sequence of the oiphan GPR10 (UHR-1) 
gene comprises SEQ ID NO: 17 or comprises the orphan GPR10 (UHR-1) gene identified in Genbank 
as Accession No.: S77867;GINO: 998527. 

An "orphan GPR14 gene" refers to a sequence comprising SEQ ID NO:21 or comprising the 
sequence encoding the orphan GPR14 [identified in Genbank as Accession No.: U32673; GI NO: 
20 1002742], In one aspect, the coding sequence of the orphan GPR14 gene comprises SEQ ID NO:21 
or comprises the orphan GPR14 gene identified in Genbank as Accession No.: U32673; GI NO: 
1002742. 

An "orphan GPR15 gene" refers to a sequence comprising SEQ ID NO:25 or comprising the 
sequence encoding the orphan GPR15 receptor [identified in Genbank as Accession No.: U34806; GI 
25 NO: 1 171 145]. In one aspect, the coding sequence of the orphan GPR15 gene comprises SEQ ID 
NO:25 or comprises the orphan GPR15 gene identified in Genbank as Accession No.: U34806; GI 
NO: 1171145. 

A "beta chemokine receptor (E01) gene" refers to a sequence comprising SEQ ID NO:29 or 
comprising the sequence encoding the beta chemokine receptor (E01) receptor [identified in Genbank 
30 as Accession No.: AF030185; GINO: 2623640]. In one aspect, the coding sequence of the beta 
chemokine receptor (E01) gene comprises SEQ ID NO:29 or comprises the beta chemokine receptor 
(E01) gene identified in Genbank as Accession No.: AF030185; GI NO: 2623640. 

A "endothelial differentiation GPCR 3 (EDG3) gene" refers to a sequence comprising SEQ 
ED NO: 33 or comprising the sequence encoding the endothelial differentiation GPCR 3 (EDG3) 
35 [identified in Genbank as Accession No.: NM010101; GINO: 6753715]. In one aspect, the coding 
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sequence of the endothelial differentiation GPCR 3 (EDG3) gene comprises SEQ ID NO:33 or 
comprises the endothelial differentiation GPCR 3 (EDG3) gene identified in Genbank as Accession 
No.: NM010101;GINO: 6753715. 

An "ATP receptor P2Ulgene" refers to a sequence comprising SEQ ID NO:37 or comprising 
5 the sequence encoding the ATP receptor P2U1 [identified in Genbank as Accession No.: L14751; GI 
NO: 309457], In one aspect, the coding sequence of the ATP receptor P2U1 gene comprises SEQ ID 
NO:37 or comprises the ATP receptor P2U1 identified in Genbank as Accession No.: L14751; GI 
NO: 309457. 

An "adenosine 3 receptor gene" refers to a sequence comprising SEQ ID NO:41 or 
10 comprising the sequence encoding the adenosine 3 receptor [identified in Genbank as Accession No.: 
L20331; GI NO: 438796]. In one aspect, the coding sequence of the adenosine 3 receptor gene 
comprises SEQ ID NO:41 or comprises the adenosine 3 receptor identified in Genbank as Accession 
No.: L20331;GINO: 438796. 

"Disruption" of a GPCR gene occurs when a fragment of genomic DNA locates and 
15 recombines with an endogenous homologous sequence. These sequence disruptions or modifications 
may include insertions, missense, frameshift, deletion, or substitutions, or replacements of DNA 
sequence, or any combination thereof. Insertions include the insertion of entire genes, which may be 
of animal, plant, fungal, insect, prokaryotic, or viral origin. Disruption, for example, can alter or 
replace a promoter, enhancer, or splice site of a GPCR gene, and can alter the normal gene product by 
20 inhibiting its production partially or completely or by enhancing the normal gene product's activity. 

The term, "transgenic cell", refers to a cell containing within its genome a GPCR gene that 
has been disrupted, modified, altered, or replaced completely or partially by the method of gene 
targeting. 

The term "transgenic animal" refers to an animal that contains within its genome a specific 
25 gene that has been disrupted by the method of gene targeting. The transgenic animal includes both the 
heterozygote animal (i.e., one defective allele and one wild-type allele) and the homozygous animal 
two defective alleles). 

As used herein, the terms "selectable marker" or "positive selection marker" refers to a gene 
encoding a product that enables only the cells that carry the gene to survive and/or grow under certain 
30 conditions. For example, plant and animal cells that express the introduced neomycin resistance 
(Ne<f) gene are resistant to the compound G418. Cells that do not carry the Ned gene marker are 
killed by G418. Other positive selection markers will be known to those of skill in the art 

A "host cell" includes an individual cell or cell culture that can be or has been a recipient for 
vectors) or for incorporation of nucleic acid molecules and/or proteins. Host cells include progeny of 
35 a single host cell, and the progeny may not necessarily be completely identical (in morphology or in 
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total DNA complement) to the original parent due to natural, accidental, or deliberate mutation. A 
host cell includes cells transfected with the constructs of the present invention. 

The term "modulates" as used herein refers to the inhibition, reduction, increase or 
enhancement of a GPCR function, expression, activity, or alternatively a phenotype associated with a 
5 disruption in a GPCR gene. 

The term "ameliorates" refers to a decreasing, reducing or eliminating of a condition, disease, 
disorder, or phenotype, including an abnormality or symptom associated with a disruption in a GPCR 
gene. 

The term "abnormality" refers to any disease, disorder, condition, or phenotype in which a 
10 disruption of a GPCR gene is implicated, including pathological conditions. 

Brief Description of the Drawings 
Figure 1 shows the polynucleotide sequence for a GPCR (melanocortin-3 receptor gene; SEQ 
ID NO:l). Figure 1 also shows the amino acid sequence for the melanocortin-3 receptor gene (SEQ 
IDNO:2). 

15 Figure 2 (Panels A and B) shows design of the targeting construct used to disrupt the 

melanocortin-3 receptor gene. Figure 2 (Panel B) shows the sequences identified as SEQ ID NO:3 
and SEQ ID NO:4, which were used as the targeting arms (homologous sequences) in the 
melanocortin-3 receptor gene targeting construct. 

Figure 3 shows the polynucleotide sequence for a 5-HT-2B (SEQ ID NO: 5). Figure 3 also 
20 shows the amino acid sequence for the 5-HT-2B (SEQ ID NO:6). 

Figure 4 (Panels A and B) shows design of the targeting construct used to disrupt 5-HT-2B 
genes. Figure 4 (Panel B) shows the sequences identified as SEQ ID NO:7 and SEQ ID NO:8, which 
were used as the targeting arms (homologous sequences) in the 5-HT-2B targeting construct. 

Figure 5 shows the polynucleotide sequence for an chemokine receptor 9A (SEQ ID NO:9). 
25 Figure 5 also shows the amino acid sequence for the chemokine receptor 9A (SEQ ID NO: 10). 

Figure 6 (Panels A and B) shows design of the targeting construct used to disrupt chemokine 
receptor 9A genes. Figure 6 (Panel B) shows the sequences identified as SEQ ID NO: 1 1 and SEQ ID 
NO: 12, which were used as the targeting arms (homologous sequences) in the chemokine receptor 9A 
targeting construct. 

30 Figure 7 shows the polynucleotide sequence for an glucocorticoid-induced receptor (SEQ ID 

NO: 13). Figure 7 also shows the amino acid sequence for the glucocorticoid-induced receptor (SEQ 
ID NO: 14). 

Figure 8 (Panels A and B) shows design of the targeting construct used to disrupt 
glucocorticoid-induced receptor genes. Figure 8 (Panel B) shows the sequences identified as SEQ ID 
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NO: 15 and SEQ ID NO: 16, which were used as the targeting arms (homologous sequences) in the 
glucocorticoid-induced receptor targeting construct. 

Figure 9 shows the polynucleotide sequence for a orphan GPR10 (UHR-1); SEQ ID NO: 17). 
Figure 9 also shows the amino acid sequence for the orphan GPR10 (UHR-1) gene (SEQ ID NO: 18). 
5 Figure 10 (Panels A and B) shows design of the targeting construct used to disrupt the orphan 

GPR10 (UHR-1) gene. Figure 10 (Panel B) shows the sequences identified as SEQ ID NO: 19 and 
SEQ ID NO:20, which were used as the targeting arms (homologous sequences) in the orphan 
GPR10 (UHR-1) gene targeting construct. 

Figure 11 shows the polynucleotide sequence for a GPCR (orphan GPR14 gene; SEQ ID 
10 NO:21). Figure 11 also shows the amino acid sequence for the orphan GPR14 gene (SEQ ID NO:22). 

Figure 12 (Panels A and B) shows design of the targeting construct used to disrupt the orphan 
GPR14 gene. Figure 12 (Panel B) shows the sequences identified as SEQ ID NO:23 and SEQ ID 
NO:24, which were used as the targeting arms (homologous sequences) in the orphan GPR14 gene 
targeting construct. . * 

15 Figure 13 shows the polynucleotide sequence for a GPCR (orphan GPR15 gene; SEQ ID 

NO:25). Figure 13 also shows the amino acid sequence for the orphan GPR15 gene (SEQ ID NO:26). 

Figure 14 (Panels A and B) shows design of the targeting construct used to disrupt the orphan 
GPR15 gene. Figure 14 (Panel B) shows the sequences identified as SEQ ID NO:27 and SEQ ID 
NO:28, which were used as the targeting arms (homologous sequences) in the orphan GPR15 gene 
20 targeting construct. 

Figure 15 shows the polynucleotide sequence for a GPCR (beta chemokine receptor (E01) 
gene; SEQ ID NO:29). Figure 15 also shows the amino acid sequence for the beta chemokine 
receptor (E01) gene (SEQ ID NO:30). 

Figure 16 (Panels A and B) shows design of the targeting construct used to disrupt the beta 
25 chemokine receptor (E01) gene. Figure 16 (Panel B) shows the sequences identified as SEQ ID 
NO:31 and SEQ ID NO:32, which were used as the targeting arms (homologous sequences) in the 
beta chemokine receptor (E01) gene targeting construct 

Figure 17 shows the polynucleotide sequence for a GPCR (endothelial differentiation GPCR 3 
(EDG3) gene; SEQ ID NO: 33). Figure 17 also shows the amino acid sequence for the endothelial 
30 differentiation GPCR 3 (EDG3) gene (SEQ ID NO:34). 

Figure 18 (Panels A and B) shows design of the targeting construct used to disrupt the 
endothelial differentiation GPCR 3 (EDG3) gene. Figure 18 (Panel B) shows the sequences identified 
as SEQ ID NO:35 and SEQ ID NO:36, which were used as the targeting aims (homologous 
sequences) in the endothelial differentiation GPCR 3 (EDG3) gene targeting construct 



WO 02/03793 PCTYUSO 1/2 1923 

-10- 

Figure 19 shows the polynucleotide sequence for a GPCR (ATP receptor P2U1 gene; SEQ ID 
NO: 1). Figure 19 also shows the amino acid sequence for the ATP receptor P2U1 gene (SEQ ID 
NO:38). 

Figure 20 (Panels A and B) shows design of the targeting construct used to disrupt the ATP 
5 receptor P2U1 gene. Figure 20 (Panel B) shows the sequences identified as SEQ ID NO:39 and SEQ 
ID NO:40, which were used as the targeting arms (homologous sequences) in the ATP receptor P2U1 
gene targeting construct 

Figure 21 shows the polynucleotide sequence for a GPCR (adenosine 3 receptor gene; SEQ 
ID NO:41). Figure 21 also shows the amino acid sequence for the adenosine 3 receptor gene (SEQ ID 
10 NO:42). 

Figure 22 (Panels A and B) shows design of the targeting construct used to disrupt the 
adenosine 3 receptor gene. Figure 22 (Panel B) shows the sequences identified as SEQ ID NO:43 and 
SEQ ID NO:44, which were used as the targeting arms (homologous sequences) in the adenosine 3 
receptor gene targeting construct. 
1 5 Detailed Description of the Invention 

The invention is based, in part, on the evaluation of the expression and role of genes and gene 
expression products, primarily those associated with a GPCR. Among others, the invention permits 
the definition of disease pathways and the identification of diagnostically and therapeutically useful 
targets. For example, genes that are mutated or down-regulated under disease conditions may be 
20 involved in causing or exacerbating the disease condition. Treatments directed at up-regulating the 
activity of such genes or treatments that involve alternate pathways, may ameliorate the disease 
condition. 

Generation of Targeting Construct 

The targeting construct of the present invention may be produced using standard methods 

25 known in the art. (see, e.g. , Sambrook, et al„ 1989, Molecular Cloning: A Laboratory Manual, 

Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York; E.N. Glover 
(eds.), 1985, DNA Cloning: A Practical Approach, Volumes I and II; MJ. Gait (ed), 1984, 
Oligonucleotide Synthesis; B.D. Hames & S J. ffiggins (eds.), 1985, Nucleic Acid Hybridization; B.D. 
Hames & SJ. Higgins (eds.), 1984, Transcription and Translation; R.I. Freshney (ed.), 1986, Animal 

30 Cell Culture; Immobilized Cells and Enzymes, KL Press, 1986; B. Perbal, 1984, A Practical Guide 
To Molecular Cloning; F.M. Ausubel et aL, 1994, Current Protocols in Molecular Biology, John 
Wiley & Sons, Inc.). For example, the targeting construct may be prepared in accordance with 
conventional ways, where sequences may be synthesized, isolated from natural sources, manipulated, 
cloned, ligated, subjected to in vitro mutagenesis, primer repair, or the like. At various stages, the 

35 joined sequences may be cloned, and analyzed by restriction analysis, sequencing, or the like. 
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The targeting DNA can be constructed using techniques well known in the art For example, 
the targeting DNA may be produced by chemical synthesis of oligonucleotides, nick-translation of a 
double-stranded DNA template, polymerase chain-reaction amplification of a sequence (or ligase 
chain reaction amplification), purification of prokaryotic or target cloning vectors harboring a 
5 sequence of interest (e.g. , a cloned cDNA or genomic DNA, synthetic DNA or from any of the afore- 
mentioned combination) such as plasmids, phagemids, YACs, cosmids, bacteriophage DNA, other 
viral DNA or replication intermediates, or purified restriction fragments thereof, as well as other 
sources of single and double-stranded polynucleotides having a desired nucleotide sequence. More- 
over, the length of homology may be selected using known methods in the art. For example, selection 
10 may be based on the sequence composition and complexity of the predetermined endogenous target 
DNA sequence(s). 

The targeting construct of the present invention typically comprises a first sequence homolo- 
gous to a portion or region of the GPCR gene and a second sequence homologous to a second portion 
or region of the GPCR gene. The targeting construct further comprises a positive selection marker, 

15 which is preferably positioned in between the first and the second DNA sequence that are homologous 
to a portion or region of the target DNA sequence. The positive selection marker may be operatively 
linked to a promoter and a polyadenylation signal. 

Other regulatory sequences known in the art may be incorporated into the targeting construct 
to disrupt or control expression of a particular gene in a specific cell type. In addition, the targeting 

20 construct may also include a sequence coding for a screening marker, for example, green fluorescent 
protein (GFP), or another modified fluorescent protein. 

Although the size of the homologous sequence is not critical and can range from as few as 50 
base pairs to as many as 100 kb, preferably each fragment is greater than about 1 kb in length, more 
preferably between about 1 and about 10 kb, and even more preferably between about 1 and about 5 

25 kb. One of skill in the art will recognize that although larger fragments may increase the number of 
homologous recombination events in ES cells, larger fragments will also be more difficult to clone. 

In a preferred embodiment of the present invention, the targeting construct is prepared 
directly from a plasmid genomic library using the methods described in pending U.S. Patent Applica- 
tion Serial No. 08/971,310, filed November 17, 1997, the disclosure of which is incorporated herein in 

30 its entirety. Generally, a sequence of interest is identified and isolated from a plasmid library in a 
single step using, for example, long-range PCR. Following isolation of this sequence, a second 
polynucleotide that will disrupt the target sequence can be readily inserted between two regions 
encoding the sequence of interest In accordance with this aspect, the construct is generated in two 
steps by (1) amplifying (for example, using long-range PCR) sequences homologous to the target 

35 sequence, and (2) inserting another polynucleotide (for example a selectable marker) into the PCR 
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product so that it is flanked by the homologous sequences. Typically, the vector is a plasmid from a 
plasmid genomic library. The completed construct is also typically a circular plasmid. 

In another embodiment, the targeting construct is designed in accordance with the regulated 
positive selection method described in U.S. Patent Application Ser. No. 60/232,957, filed September 
5 15, 2000, the disclosure of which is incorporated herein in its entirety. The targeting construct is 
designed to include a PGK-neo fusion gene having two lacO sites, positioned in the PGK promoter 
and an NLS-lacI gene comprising a lac repressor fused to sequences encoding the NLS from the 
SV40 T antigen. 

In another embodiment, the targeting construct may contain more than one selectable maker 
10 gene, including a negative selectable marker, such as the herpes simplex virus tk (HSV-tk) gene. The 
negative selectable marker may be operatively linked to a promoter and a polyadenylation signal. 
{see, e.g., U.S. Patent No. 5,464,764; U.S. Patent No. 5,487,992; U.S. Patent No. 5,627,059; and U.S. 
Patent No. 5,631,153). 

Generation of Cells and Confirmation of Homologous Recombination Events 

15 Once an appropriate targeting construct has been prepared, the targeting construct may be 

introduced into an appropriate host cell using any method known in the art. Various techniques may 
be employed in the present invention, including, for example, pronuclear microinjection; retrovirus 
mediated gene transfer into germ lines; gene targeting in embryonic stem cells; electroporation of 
embryos; sperm-mediated gene transfer; and calcium phosphate/DNA co-precipitates, microinjection 

20 of DNA into the nucleus, bacterial protoplast fusion with intact cells, transfection, polycations, e.g. , 
polybrene, polyornithine, etc., or the like (see, e.g., U.S. Patent No. 4,873,191; Van der Putten, et aL t 
1985, Proc. Natl Acad. ScL, USA 82:6148-6152; Thompson, etal., 1989, Cell 56:313-321; Lo, 1983, 
Mol Cell. Biol 3:1803-1814; Lavitrano, et al. t 1989, Cell, 57:717-723). Various techniques for trans- 
forming mammalian cells are known in the art (see, e.g., Gordon, 1989, Intl Rev. Cytol, 115:171- 

25 229; Keown et al. , 1989, Methods in Enzymology, Keown et al, 1990, Methods and Enzymology, Vol. 
185, pp. 527-537; Mansour et al., 1988, Nature, 336:348-352). 

In a preferred aspect of the present invention, the targeting construct is introduced into host 
cells by electroporation. In this process, electrical impulses of high field strength reversibly perme- 
abilize biomembranes allowing the introduction of the construct. The pores created during electro- 

30 poration permit the uptake of macromolecules such as DNA (see, e.g., Potter, H., et al., 1984, Proc. 
Nat'l Acad. ScL U.S.A. 81:7161-7165). 

Any cell type capable of homologous recombination may be used in the practice of the 
present invention. Examples of such target cells include cells derived from vertebrates including 
mammals such as humans, bovine species, ovine species, murine species, simian species, and ether 

35 eucaryotic organisms such as filamentous fungi, and higher multicellular organisms such as plants. 
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Preferred cell types include embryonic stem (ES) cells, which are typically obtained from pre- 
implantation embryos cultured in vitro (see, e.g., Evans, M. J., et aL 9 1981, Nature 292:154-156; 
Bradley, M. O., et al, 1984, Nature 309:255-258; Gossler et al, 1986, Proc. Natl Acad. Set USA 
83:9065-9069; and Robertson, et al, 1986, Nature 322:445-448). The ES cells are cultured and 
5 prepared for introduction of the targeting construct using methods well known to the skilled artisan 
(see, e.g., Robertson, E. J. ed. "Teratocarcinomas and Embryonic Stem Cells, a Practical Approach", 
IRL Press, Washington D.C., 1987; Bradley et al., 1986, Current Topics in Devel. Biol 20:357-371; 
by Hogan et al, in "Manipulating the Mouse Embryo": A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor N.Y., 1986; Thomas et al, 1987, Cell 51:503; Koller et al, 

10 1991, Proc. Natl. Acad. Sci. USA, 88:10730; Dorin et al, 1992, Traits genie Res. 1:101; and Veisef 
al., 1993, Cell 75:229). The ES cells that will be inserted with the targeting construct are derived 
from an embryo or blastocyst of the same species as the developing embryo into which they are to be 
introduced. ES cells are typically selected for their ability to integrate into the inner cell mass and 
contribute to the germ line of an individual when introduced into the mammal in an embryo at the 

15 blastocyst stage of development. Thus, any ES cell line having this capability is suitable for use in the 
practice of the present invention. 

The present invention may also be used to knockout genes in other cell types, such as stem 
cells. By way of example, stem cells may be myeloid, lymphoid, or neural progenitor and precursor 
cells. These cells comprising a disruption or knockout of a gene may be particularly useful in the 

20 study of GPCR gene function in individual developmental pathways. Stem cells may be derived from 
any vertebrate species, such as mouse, rat, dog, cat, pig, rabbit, human, non-human primates and the 
like. 

After the targeting construct has been introduced into cells, the cells where successful gene 
targeting has occurred are identified. Insertion of the targeting construct into the targeted gene is 

25 typically detected by identifying cells for expression of the marker gene. In a preferred embodiment, 
the cells transformed with the targeting construct of the present invention are subjected to treatment 
with an appropriate agent that selects against cells not expressing the selectable marker. Only those 
cells expressing the selectable marker gene survive and/or grow under certain conditions. For 
example, cells that express the introduced neomycin resistance gene are resistant to the compound 

30 G418, while cells that do not express the neo gene marker are killed by G418. If the targeting 
construct also comprises a screening marker such as GFP, homologous recombination can be 
identified through screening cell colonies under a fluorescent light. Cells that have undergone 
homologous recombination will have deleted the GEP gene and will not fluoresce. 

If a regulated positive selection method is used in identifying homologous recombination 

35 events, the targeting construct is designed so that the expression of the selectable marker gene is 
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regulated in a manner such that expression is inhibited following random integration but is permitted 
(dexepressed) following homologous recombination. More particularly, the transfected cells are 
screened for expression of the neo gene, which requires that (1) the cell was successfully electropo- 
rated, and (2) lac repressor inhibition of neo transcription was relieved by homologous recombination. 
5 This method allows for the identification of transfected cells and homologous recombinants to occur 
in one step with the addition of a single drug. 

Alternatively, a positive-negative selection technique may be used to select homologous 
recombinants. This technique involves a process in which a first drug is added to the cell population, 
for example, a neomycin-like drug to select for growth of transfected cells, Le. positive selection. A 

10 second drug, such as FIAU is subsequently added to kill cells that express the negative selection 

marker, Le. negative selection. Cells that contain and express the negative selection marker are killed 
by a selecting agent, whereas cells that do not contain and express the negative selection marker 
survive. For example, cells with non-homologous insertion of the construct express HSV thymidine 
kinase and therefore are sensitive to the herpes drugs such as gancyclovir (GANC) or FIAU (H2- 

15 deoxy 2-fluoro-B-D-arabinofluranosyl)-5-iodouracil) (see, e.g. , Mansour et al , Nature 336:348-352: 
(1988); Capecchi, Science 244:1288-1292, (1989); Capecchi, Trends in Genet. 5:70-76 (1989)). 

Successful recombination may be identified by analyzing the DNA of the selected cells to 
confirm homologous recombination. Various techniques known in the art, such as PCR and/or 
Southern analysis may be used to confirm homologous recombination events. 

20 Homologous recombination may also be used to disrupt genes in stem cells, and other cell 

types, which are not totipotent embryonic stem cells. By way of example, stem cells may be myeloid, 
lymphoid, or neural progenitor and precursor cells. Such transgenic cells may be particularly useful in 
the study of GPCR gene function in individual developmental pathways. Stem cells may be derived 
from any vertebrate species, such as mouse, rat, dog, cat, pig, rabbit, human, non-human primates and 

25 the like. 

In cells that are not totipotent it may be desirable to knock out both copies of the target using 
methods that are known in the art. For example, cells comprising homologous recombination at a 
target locus that have been selected for expression of a positive selection marker (e.g., Neo 1 ) and 
screened for non-random integration, can be further selected for multiple copies of the selectable 

30 marker gene by exposure to elevated levels of the selective agent (e.g., G418). The cells are then 
analyzed for homozygosity at the target locus. Alternatively, a second construct can be generated 
with a different positive selection marker inserted between the two homologous sequences. The two 
constructs can be introduced into the cell either sequentially or simultaneously, followed by appro- 
priate selection for each of the positive marker genes. The final cell is screened for homologous 

35 recombination of both alleles of the target 
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Production of Transgenic Animals 

Selected cells are then injected into a blastocyst (or other stage of development suitable for 
the purposes of creating a viable animal, such as, for example, a morula) of an animal (e.g., a mouse) 
to form chimeras (see e.g., Bradley, A. in Teratocarcinomas and Embryonic Stem Cells: A Practical 
5 Approach, E. J. Robertson, ed., IRL, Oxford, pp. 1 13-152 (1987)). Alternatively, selected ES cells 
can be allowed to aggregate with dissociated mouse embryo cells to form the aggregation chimera. A 
chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal and the 
embryo brought to term. Chimeric progeny harboring the homologously recombined DNA in their 
germ cells can be used to breed animals in which all cells of the animal contain the homologously 

10 recombined DNA. In one embodiment, chimeric progeny mice are used to generate a mouse with a 
heterozygous disruption in the GPCR gene. Heterozygous transgenic mice can then be mated. It is 
well know in the art that typically Va of the offspring of such matings will have a homozygous 
disruption in the GPCR gene. 

The heterozygous and homozygous transgenic mice can then be compared to normal, wild 

15 type mice to determine whether disruption of the GPCR gene causes phenotypic changes, especially 
pathological changes. For example, heterozygous and homozygous mice may be evaluated for 
phenotypic changes by physical examination, necropsy, histology, clinical chemistry, complete blood 
count, body weight, organ weights, and cytological evaluation of bone marrow. 

In one embodiment, the phenotype (or phenotypic change) associated with a disruption in the 

20 GPCR gene is placed into or stored in a database. Preferably, the database includes: (i) genotypic data 
(e.g., identification of the disrupted gene) and (ii) phenotypic data (e.g., phenotype(s) resulting from 
the gene disruption) associated with the genotypic data. The database is preferably electronic. In 
addition, the database is preferably combined with a search tool so that the database is searchable. 
Conditional Transgenic Animals 

25 The present invention further contemplates conditional transgenic or knockout animals, such 

as those produced using recombination methods. Bacteriophage PI Cre recombinase and flp 
recombinase from yeast plasmids are two non-limiting examples of site-specific DNA recombinase 
enzymes that cleave DNA at specific target sites (lox P sites for cre recombinase and fit sites for flp 
recombinase) and catalyze a ligation of this DNA to a second cleaved site. A large number of suitable 

30 alternative site-specific recombinases have been described, and their genes can be used in accordance 
with the method of the present invention. Such recombinases include the Int recombinase of 
bacteriophage X (with or without Xis) (Weisberg, R. et al, in Lambda II, (Hendrix, R., et a/., Eds.), 
Cold Spring Harbor Press, Cold Spring Harbor, NY, pp. 21 1-50 (1983), herein incorporated by 
reference); Tpnl and the p-lactamase transposons (Merrier, etaL,J. Bacteriol, 172:3745-57 (1990)); 

35 the Tn3 resolvase (Flanagan & Fennewald /. Molec. Biol, 206:295-304 (1989); Stark, et al, Cell, 
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58:779-90 (1989)); the yeast recombinases (Matsuzaki, et al, J. BacterioL, 172:610-18 (1990)); the B. 
subtilis SpoIVC recombinase (Sato, et al, J. BacterioL 172:1092-98 (1990)); the Hp recombinase 
(Schwartz & Sadowski, J. MolecMol, 205:647-658 (1989); Parsons, etal, J. Biol Client., 265:4527- 
33 (1990); Golic & Lindquist, Cell 59:499-509 (1989); Amin, etal., J. Molec. Biol, 214:55-72 
5 (1990)); the Hin recombinase (Glasgow, et al, J. Biol Chem., 264: 10072-82 (1989)); immunoglo- 
bulin recombinases (Malynn, et al, Cell, 54:453-460 (1988)); and the Cin recombinase (Haffter & 
Bickle, EMBO J., 7:3991-3996 (1988); Hubner, et aL, J. Molec. Biol, 205:493-500 (1989)), all herein 
incorporated by reference. Such systems are discussed by Echols (/. Biol Chem. 265:14697-14700 
(1990)); de Villartay (Nature, 335:170-74 (1988)); Craig, (Ann. Rev. Genet, 22:77-105 (1988)); 

10 Poyart-Salmeron, et al., (EMBO J. 8:2425-33 (1989)); Hunger-Bertling, et al.,(Mol Cell Biochem., 
92:107-16 (1990)); and Cregg & Madden (Mol. Gen. Genet., 219:320-23 (1989)), all herein 
incorporated by reference. 

Cre has been purified to homogeneity, and its reaction with the loxP site has been extensively 
characterized (Abremski & Hess /. Mol Biol 259:1509-14 (1984), herein incorporated by reference). 

15 Cre protein has a molecular weight of 35,000 and can be obtained commercially from New England 
Nuclear/DuPont. The cre gene (which encodes the Cre protein) has been cloned and expressed 
(Abremski, et al., Cell 32:1301-11 (1983), herein incorporated by reference). The Cre protein medi- 
ates recombination between two loxP sequences (Sternberg, et al., Cold Spring Harbor Symp. Quant. 
Biol 45:297-309 (1981)), which may be present on the same or different DNA molecule. Because the 

20 internal spacer sequence of the loxP site is asymmetrical, two loxP sites can exhibit directionality 
relative to one another (Hoess & Abremski Proc. Natl Acad. ScL U.S.A. 81:1026-29 (1984;). Thus, 
when two sites on the same DNA molecule are in a directly repeated orientation, Cre will excise the 
DNA between the sites (Abremski, et al, Cell 32:1301-11 (1983)). However, if the sites are inverted 
with respect to each other, the DNA between them is not excised after recombination but is simply 

25 inverted. Thus, a circular DNA molecule having two loxP sites in direct orientation will recombine to 
produce two smaller circles, whereas circular molecules having two loxP sites in an inverted orienta- 
tion simply invert the DNA sequences flanked by the loxP sites. In addition, recombinase action can 
result in reciprocal exchange of regions distal to the target site when targets are present on separate 
DNA molecules. 

30 Recombinases have important application for characterizing gene function in knockout 

models. When- the constructs described herein are used to disrupt GPCR genes, a fusion transcript can 
be produced when insertion of the positive selection marker occurs downstream (3*) of the translation 
initiation site of the GPCR gene. The fusion transcript could result in some level of protein expres- 
sion with unknown consequence. It has been suggested that insertion of a positive selection marker 

35 gene can affect the expression of nearby genes. These effects may make it difficult to determine gene 
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function after a knockout event since one could not discern whether a given phenotype is associated 
with the inactivation of a gene, or the transcription of nearby genes. Both potential problems are 
solved by exploiting recombinase activity. When the positive selection marker is flanked by recombi- 
nase sites in the same orientation, the addition of the corresponding recombinase wDl result in the 
5 removal of the positive selection marker. In this way, effects caused by the positive selection marker 
or expression of fusion transcripts are avoided. 

In one embodiment, purified recombinase enzyme is provided to the cell by direct 
microinjection. In another embodiment, recombinase is expressed from a co-transfected construct or 
vector in which the recombinase gene is operably linked to a functional promoter. An additional 

10 aspect of this embodiment is the use of tissue-specific or inducible recombinase constructs that allow 
the choice of when and where recombination occurs. One method for practicing the inducible forms 
of recombinase-mediated recombination involves the use of vectors that use inducible or tissue- 
specific promoters or other gene regulatory elements to express the desired recombinase activity. The 
inducible expression elements are preferably operatively positioned to allow the inducible control or 

15 activation of expression of the desired recombinase activity. Examples of such inducible promoters or 
other gene regulatory elements include, but are not limited to, tetracycline, metallothionine, ecdysone, 
and other steroid-responsive promoters, rapamycin responsive promoters, and the like (No, et al. t 
Proc. Natl Acad. ScL USA, 93:3346-51 (1996); Furth, et al f Proc. Natl Acad Set USA, 91:9302-6 
(1994)). Additional control elements that can be used include promoters requiring specific 

20 transcription factors such as viral, promoters. Vectors incorporating such promoters would only 
express recombinase activity in cells that express the necessary transcription factors. 
Models for Disease 

The cell- and animal-based systems described herein can be utilized as models for diseases. 
Animals of any species, including, but not limited to, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, 

25 goats, and non-human primates, e.g., baboons, monkeys, and chimpanzees may be used to generate 
disease animal models. In addition, cells from humans may be used. These systems may be used in a 
variety of applications. Such assays may be utilized as part of screening strategies designed to identi- 
fy agents, such as compounds that are capable of ameliorating disease symptoms. Thus, the animal- 
and cell-based models may be used to identify drugs, pharmaceuticals, therapies and interventions that 

30 may be effective in treating disease. 

Cell-based systems may be used to identify compounds that may act to ameliorate disease 
symptoms. For example, such cell systems may be exposed to a compound suspected of exhibiting an 
ability to ameliorate disease symptoms, at a sufficient concentration and for a time sufficient to elicit 
such an amelioration of disease symptoms in the exposed cells. After exposure, the cells are 
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examined to determine whether one or more of the disease cellular phenotypes has been altered to 
resemble a more normal or more wild type, non-disease phenotype. 

In addition, animal-based disease systems, such as those described herein, may be used to 
identify compounds capable of ameliorating disease symptoms. Such animal models may be used as 
5 test substrates for the identification of drugs, pharmaceuticals, therapies, and interventions that may be 
effective in treating a disease or other phenotypic characteristic of the animal. For example, animal 
models may be exposed to a compound or agent suspected of exhibiting an ability to ameliorate 
disease symptoms, at a sufficient concentration and for a time sufficient to elicit such an amelioration 
of disease symptoms in the exposed animals. The response of the animals to the exposure may be 

10 monitored by assessing the reversal of disorders associated with the disease. Exposure may involve 
treating mother animals during gestation of the model animals described herein, thereby exposing 
embryos or fetuses to the compound or agent that may prevent or ameliorate the disease or phenotype. 
Neonatal, juvenile, and adult animals can also be exposed. 

More particularly, using the animal models of the invention, specifically, transgenic mice, 

15 methods of identifying agents, including compounds are provided, preferably, on the basis of the 

ability to affect at least one phenotype associated with a disruption in a GPCR gene. In one embodi- 
ment, the present invention provides a method of identifying agents having an effect on GPCR 
expression or function. The method includes measuring a physiological response of the animal, for 
example, to the agent, and comparing the physiological response of such animal to a control animal, 

20 wherein the physiological response of the animal comprising a disruption in a GPCR as compared to 
the control animal indicates the specificity of the agent. A "physiological response" is any biological 
or physical parameter of an animal that can be measured. Molecular assays (e.g., gene transcription, 
protein production and degradation rates), physical parameters (e.g., exercise physiology tests, 
measurement of various parameters of respiration, measurement of heart rate or blood pressure, 

25 measurement of bleeding time, aPTT.T, or TT), and cellular assays (e.g.,. immunohistochemical 

assays of cell surface markers, or the ability of cells to aggregate or proliferate) can be used to assess a 
physiological response. 

The transgenic animals and cells of the present invention may be utilized as models for 
diseases, disorders, or conditions associated with phenotypes relating to a disruption in a GPCR. The 

30 present invention also provides a unique animal model for testing and developing new treatments re- 
lating to the behavioral phenotypes. Analysis of the behavioral phenotype allows for the development 
of an animal model useful for testing, for instance, the efficacy of proposed genetic and pharmacolo- 
gical therapies for human genetic diseases, such as neurological, neuropsychological, or psychotic 
illnesses. 
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A statistical analysis of the various behaviors measured can be carried out using any conven- 
tional statistical program routinely used by those skilled in the art (such as, for example, "Analysis of 
Variance" or ANOVA). A "p" va l ue of about 0.05 or less is generally considered to be statistically 
significant, although slightly higher p values may still be indicative of statistically significant 
5 differences. To statistically analyze abnormal behavior, a comparison is made between the behavior of 
a transgenic ani m al (or a group thereof) to the behavior of a wild-type mouse (or a group thereof), 
typically under certain prescribed conditions. "Abnormal behavior" as used herein refers to behavior 
exhibited by an animal having a disruption in the GPCR gene, e.g. transgenic animal, which differs 
from an animal without a disruption in the GPCR gene, e.g. wild-type mouse. Abnormal behavior 

10 consists of any number of standard behaviors that can be objectively measured (or observed) and 
compared. In the case of comparison, it is preferred that the change be statistically significant to 
confirm that there is indeed a meaningful behavioral difference between the knockout animal and the 
wild-type control animal. Examples of behaviors that may be measured or observed include, but are 
not limited to, ataxia, rapid limb movement, eye movement, breathing, motor activity, cognition, 

15 emotional behaviors, social behaviors, hyperactivity, hypersensitivity, anxiety, impaired learning, 
abnormal reward behavior, and abnormal social interaction, such as aggression, 

A series of tests may be used to measure the behavioral phenotype of the animal models of the 
present invention, including neurological and neuropsychological tests to identify abnormal behavior. 
These tests may be used to measure abnormal behavior relating to, for example, learning and memory, 

20 eating, pain, aggression, sexual reproduction, anxiety, depression, schizophrenia, and drug abuse. 
(see, e.g., Crawley & Paylor, Hormones and Behavior 31:197-211 (1997)). 

The social interaction test involves exposing a mouse to other animals in a variety of settings. 
The social behaviors of the animals (e.g., touching, climbing, sniffing, and mating) are subsequently 
evaluated. Differences in behaviors can then be statistically analyzed and compared (see, e.g., S. E. 

25 File, et al., Pharmacol. Bioch. Behav. 22:941-944 (1985); R. R. Holson, Phys. Behav. 37:239-247 
(1986)): Examplary behavioral tests include the following. 

The mouse startle response test typically involves exposing the animal to a sensory (typically 
auditory) stimulus and measuring the startle response of the animal (see, e.g., M. A. Geyer, et al., 
Brain Res. Bull 25:485-498 (1990); Paylor and Crawley, Psychopharmacology 132:169-180 (1997)). 

30 A pre-pulse inhibition test can also be used, in which the percent inhibition (from a normal startle 
response) is measured by "cueing" the animal first with a brief low-intensity pre-pulse prior to the 
startle pulse. 

The electric shock test generally involves exposure to an electrified surface and measurement 
of subsequentbehaviors such as, for example, motor activity, learning, social behaviors. The behavi- . 
35 ors are measured and statistically analyzed using standard statistical tests, (see, e.g., G. J. Kant, et aL, 
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Pharm. Biock Behav. 20:793-797 (1984); N. J. Leidenheimer, et al, Pharmacol Bioch. Behav. 
30:351-355 (1988)). 

The tail-pinch or immobilization test involves applying pressure to the tail of the animal 
and/or restraining the animal's movements. Motor activity, social behavior, and cognitive behavior 
5 are examples of the areas that are measured, (see, e.g., M. Bertolucci D'Angic, et al, Neurochem. 
55:1208-1214(1990)). 

The novelty test generally comprises exposure to a novel environment and/or novel objects. 
The animal's motor behavior in the novel environment and/or around the novel object are measured 
and statistically analyzed, (see, e.g., D. K. Reinstein, et al, Pharm. Biock Behav. 17:193-202 (1982); 
10 B. Poucet, Behav. NeuroscL 103:1009-10016 (1989); R. R. Holson, et al, Phys. Behav. 37:231-238 
(1986)). This test may be used to detect visual processing deficiencies or defects. 

The learned helplessness test involves exposure to stresses, for example, noxious stimuli, 
which cannot be affected by the animal's behavior. The animal's behavior can be statistically analyzed 
using various standard statistical tests, (see, e.g., A. Leshner, et al, Behav. Neural Biol. 26:497-501 
15 (1979)). 

Alternatively, a tail suspension test may be used, in which the "immobile" time of the mouse 
is measured when suspended "upside-down" by its tail. This is a measure of whether the animal 
struggles, an indicator of depression. In humans, depression is believed to result from feelings of a 
lack of control over one's life or situation. It is believed that a depressive state can be elicited in 

20 animals by repeatedly subjecting them to aversive situations over which they have no control. A 
condition of "learned helplessness" is eventually reached, in which the animal will stop trying to 
change its circumstances and simply accept its fate. Animals that stop struggling sooner are believed 
to be more prone to depression. Studies have shown that the administration of certain antidepressant 
drugs prior to testing increases the amount of time that animals struggle before giving up. 

25 The Morris water-maze test comprises learning spatial orientations in water and subsequentiy 

measuring the animal's behaviors, such as, for example, by counting the number of incorrect choices. 
The behaviors measured are statistically analyzed using standard statistical tests, (see, e.g., E. M. 
Spruijt, et aL f Brain Res. 527:192-197 (1990)). 

Alternatively, a Y-shaped maze may be used (see, e.g., McFarland, D J., Pharmacology, 

30 Biochemistry and Behavior 32:723-726 (1989); Delhi, R, et al. , Neurobiology of Learning and 

Memory 73:31-48 (2000)). The Y-maze is generally believed to be a test of cognitive ability. The 
dimensions of each arm of the Y-maze can be, for example, approximately 40 cm x 8 cm x 20 cm, 
although other dimensions may be used. Each arm can also have, for example, sixteen equally spaced 
photobeams to automatically detect movement within the arms. At least two different tests can be 

35 performed using such a Y-maze. In a continuous Y-maze paradigm, mice are allowed to explore all 
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three arms of a Y-maze for, e.g., approximately 10 minutes. The animals are continuously tracked 
using photobeam detection grids, and the data can be used to measure spontaneous alteration and 
positive bias behavior. Spontaneous alteration refers to the natural tendency of a "normal" animal to 
visit the least familiar arm of a maze. An alternation is scored when the animal makes two consecu- 
5 tive turns in the same direction, thus representing a sequence of visits to the least recently entered arm 
of the maze. Position bias determines egocentrically defined responses by measuring the animal's 
tendency to favor turning in one direction over another. Therefore, the test can detect differences in 
an animal's ability to navigate on the basis of allocentric or egocentric mechanisms. The two-trial Y- 
maze memory test measures response to novelty and spatial memory based on a free-choice 

10 exploration paradigm. During the first trial (acquisition), the animals are allowed to freely visit two 
arms of the Y-maze for, e.g., approximately 15 minutes. The third arm is blocked off during this trial. 
The second trial (retrieval) is performed after an intertrial interval of, e.g., approximately 2 hours. 
During the retrieval trial, the blocked arm is opened and the animal is allowed access to all three arms 
for, e.g., approximately 5 minutes. Data are collected during the retrieval trial and analyzed for the 

15 number and duration of visits to each arm. Because the three arms of the maze are virtually identical, 
discrimination between novelty and familiarity is dependent on "environmental" spatial cues around 
the room relative to the position of each arm. Changes in arm entry and duration of time spent in the 
novel arm in a transgenic animal model may be indicative of a role of that gene in mediating novelty 
and recognition processes. 

20 The passive avoidance or shuttle box test generally involves exposure to two or more 

environments, one of whiph is noxious, providing a choice to be learned by the animal. Behavioral 
measures include, for example, response latency, number of correct responses, and consistency of 
response, (see, e.g., R. Ader, etal, Psychon. Sci. 26:125-128 (1972); R. R. Holson, Phys. Behav. 
37:221-230 (1986)). Alternatively, a zero-maze can be used. In a zero-maze, the animals can, for 

25 example, be placed in a closed quadrant of an elevated annular platform having, e.g., 2 open and 2 
closed quadrants, and are allowed to explore for approximately 5 minutes. This paradigm exploits an 
approach-avoidance conflict between normal exploratory activity and an aversion to open spaces in 
rodents. This test measures anxiety levels and can be used to evaluate the effectiveness of anti- 
anxiolytic drugs. The time spent in open quadrants versus closed quadrants may be recorded 

30 automatically, with, for example, the placement of photobeams at each transition site. 

The food avoidance test involves exposure to novel food and objectively measuring, for 
example, food intake and intake latency. The behaviors measured are statistically analyzed using 
standard statistical tests, (see, e.g., B. A. Campbell, et aL, J. Comp. Physiol. Psychol 67:15-22 
(1969)), 
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The elevated plus-maze test comprises exposure to a maze, without sides, on a platform, the 
animal's behavior is objectively measured by counting the number of maze entries and maze learning. 
The behavior is statistically analyzed using standard statistical tests, (see, e.g., H. A. Baldwin, et al, 
Brain Res. Bull, 20:603-606 (1988)). 
5 The stimulant-induced hyperactivity test involves injection of stimulant drugs (e.g., 

amphetamines, cocaine, POP, and the like), and objectively measuring, for example, motor activity, 
social interactions, cognitive behavior. The animal's behaviors are statistically analyzed using 
standard statistical tests, (see, e.g., P. B. S. Clarke, etal, Psychopharmacology 96:511-520 (1988); P. 
Kuczenski, et al, J. Neuroscience 11:2703-2712 (1991)). 
10 The self-stimulation test generally comprises providing the mouse with the opportunity to . 

regulate electrical and/or chemical stimuli to its own brain. Behavior is measured by frequency and 
pattern of self-stimulation. Such behaviors are statistically analyzed using standard statistical tests. 
(see, e.g., S. Nassif, et al, Brain Res., 332:247-257 (1985); W. L. Isaac, et al., Behav. Neurosci. 
103:345-355 (1989)). 

15 The reward test involves shaping a variety of behaviors, e.g., motor, cognitive, and social, 

measuring, for example, rapidity and reliability of behavioral change, and statistically analyzing the 
behaviors measured, (see, e.g., L. E. Jarrard, et al, Exp. Brain Res. 61:519-530 (1986)). 

The DRL (differential reinforcement to low rates of responding) performance test involves 
exposure to intermittent reward paradigms and measuring the number of proper responses, e.g., lever 

20 pressing. Such behavior is statistically analyzed using standard statistical tests, (see, e.g., J. D. Sinden, 
et al, Behav. Neurosci. 100:320-329 (1986); V. Nalwa, et al., Behav Brain Res. 17:73-76 (1985); and 
A. J. Nonneman, et al., J. Comp. Physiol. Psych. 95:588-602 (1981)). 

The spatial learning test involves exposure to a complex novel environment, measuring the 
rapidity and extent of spatial learning, and statistically analyzing the behaviors measured, (see, e.g., N. 

25 Pitsikas, et al, Pharm. Bioch. Behav. 38:931-934 (1991); B. Poucet, et al., Brain Res. 37:269-280 
(1990); D. Christie, et al., Brain Res. 37:263-268 (1990); and F. Van Haaren, et al., Behav. Neurosci. 
102:481-488 (1988)). Alternatively, an open-field (of) test may be used, in which the greater distance 
traveled for a given amount of time is a measure of the activity level and anxiety of the animal. When 
the open field is a novel environment, it is believed that an approach-avoidance situation is created, in 

30 which the animal is 'torn" between the drive to explore and the drive to protect itself. Because the 
chamber is lighted and has no places to hide other than the corners, it is expected that a "normal" 
mouse will spend more time in the corners and around the periphery than it will in the center where 
there is no place to hide. "Normal" mice will, however, venture into the central regions as they 
explore more and more of the chamber. It can then be extrapolated that especially anxious mice will 

35 spend most of their time in the corners, with relatively little or no exploration of the central region, 
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whereas bold (i.e. y less anxious) mice will travel a greater distance, showing little preference for the 
periphery versus the central region. 

The visual, somatosensory and auditory neglect tests generally comprise exposure to a 
sensory stimulus, objectively measuring, for example, orientating responses, and statistically 
5 analyzing the behaviors measured, {see, e.g., J. M. Vargo, et al., Exp. Neurol. 102: 199-209 (1988)). 

The consummatory behavior test generally comprises feeding and drinking, and objectively 
measuring quantity of consumption. The behavior measured is statistically analyzed using standard 
statistical tests, (see, e.g., P. J. Fletcher, etal, Psychopharmaco). 102:301-308 (1990); M. G. Corda, et 
al.„ Proc. Natl Acad. Sci. USA 80:2072-2076 (1983)). 

10 A visual discrimination test can also be used to evaluate the visual processing of an animal. 

One or two similar objects are placed in an open field and the animal is allowed to explore for about 
5-10 minutes. The time spent exploring each object (proximity to, i.e., movement within, e.g., about 
3-5 cm of the object is considered exploration of an object) is recorded. The animal is then removed 
from the open field, and the objects are replaced by a similar object and a novel object. The animal is 

15 returned to the open field and the percent time spent exploring the novel object over the old object is 
measured (again, over about a 5-10 minute span). "Normal" animals will typically spend a higher 
percentage of time exploring the novel object rather than the old object. If a delay is imposed between 
sampling and testing, the memory task becomes more hippocampal-dependent. If no delay is 
imposed, the task is more based on simple visual discrimination. This test can also be used for 

20 olfactory discrimination, in which the objects (preferably, simple blocks) can be sprayed or otherwise 
treated to hold an odor. This test can also be used to determine if the animal can make gustatory 
discriminations; animals that return to the previously eaten food instead of novel food exhibit 
gustatory neophobia. 

A hot plate analgesia test can be used to evaluate an animal's sensitivity to heat or painful 
25 stimuli. For example, a mouse can be placed on an approximately 55°C hot plate and the mouse's 
response latency (e.g., time to pick up and lick a hind paw) can be recorded These responses are not 
reflexes, but rather "higher" responses requiring cortical involvement. This test may be used to 
evaluate a nociceptive disorder. 

An accelerating rotarod test may be used to measure coordination and balance in mice. 
30 Animals can be, for example, placed on a rod that acts like a rotating treadmill (or rolling log). The 
rotarod can be made to rotate slowly at first and then progressively faster until it reaches a speed of, 
e.g., approximately 60 rpra The mice must continually reposition themselves in order to avoid falling 
off. The animals are preferably tested in at least three trials, a minimum of 20 minutes apart Those 
mice that are able to stay on the rod the longest are believed to have better coordination and balance. 
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A metrazol administration test can be used to screen animals for varying susceptibilities to 
seizures or similar events. For example, a 5mg/ml solution of metrazol can be infused through the tail 
vein of a mouse at a rate of, e.g. , approximately 0.375 ml/min. The infusion will cause all mice to 
experience seizures, followed by death. Those mice that enter the seizure stage the soonest are 
5 believed to be more prone to seizures. Four distinct physiological stages can be recorded: soon after 
the start of infusion, the mice will exhibit a noticeable "twitch", followed by a series of seizures, 
ending in a final tensing of the body known as "tonic extension", which is followed by death. 
GPCR Gene Products 

The present invention further contemplates use of the GPCR gene sequence to produce GPCR 

10 gene products. GPCR gene products may include proteins that represent functionally equivalent gene 
products. Such an equivalent gene product may contain deletions, additions or substitutions of amino 
acid residues within the amino acid sequence encoded by the gene sequences described herein, but 
which result in a silent change, thus producing a functionally equivalent GPCR gene product. Amino 
acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydropho- 

15 bicity, hydrophilicity, and/or the amphipathic nature of the residues involved. 

For example, nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, 
proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycine, serine, 
threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) amino acids 
include arginine, lysine, and histidine; and negatively charged (acidic) amino acids include aspartic 

20 . acid and glutamic acid. "Functionally equivalent", as utilized herein, refers to a protein capable of 
exhibiting a substantially similar in vivo activity as the endogenous gene products encoded by the 
GPCR gene sequences. Alternatively, when utilized as part of an assay, "functionally equivalent" 
may refer to peptides capable of interacting with other cellular or extracellular molecules in a manner 
substantially similar to the way in which the corresponding portion of the endogenous gene product 

25 would. 

Other protein products useful according to the methods of the invention are peptides derived 
from or based on the GPCR gene produced by recombinant or synthetic means (derived peptides). 

GPCR gene products may be produced by recombinant DNA technology using techniques 
well known in the art Thus, methods for preparing the gene polypeptides and peptides of the inven- 

30 tion by expressing nucleic acid encoding gene sequences are described herein. Methods that are well 
known to those skilled in the art can be used to construct expression vectors containing gene protein 
coding sequences and appropriate transcriptional/translational control signals. These methods 
include, for example, in vitro recombinant DNA techniques, synthetic techniques and in vivo 
recombination/genetic recombination (see, e.g., Sambrook, et aL 9 1989, supra, and Ausubel, et aL, 

35 1989, supra). Alternatively, RNA capable of encoding gene protein sequences may be chemically 
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synthesized using, for example, automated synthesizers {see, e.g. Oligonucleotide Synthesis: A 
Practical Approach, Gait, M. J. ed., IRL Press, Oxford (1984)). 

A variety of host-expression vector systems may be utilized to express the gene coding 
sequences of the invention. Such host-expression systems represent vehicles by which the coding 
5 sequences of interest may be produced and subsequently purified, but also represent cells that may, 
when transformed or transfected with the appropriate nucleotide coding sequences, exhibit the gene 
protein of the invention in situ. These include but are not limited to microorganisms such as bacteria 
(e.g., E. coli, B. subtilis) transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid 
DNA expression vectors containing gene protein coding sequences; yeast (e.g. Saccharomyces, 

10 Pichia) transformed with recombinant yeast expression vectors containing the gene protein coding 
sequences; insect cell systems infected with recombinant virus expression vectors (e.g., baculovirus) 
containing the gene protein coding sequences; plant cell systems infected with recombinant virus 
expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or 
transformed with recombinant plasmid expression vectors (e.g., Ti plasmid) containing gene protein 

15 coding sequences; or mammalian cell systems (e.g. COS, CHO, BHK, 293, 3T3) harboring recombi- 
nant expression constructs containing promoters derived from the genome of mammalian cells (e.g., 
metallothionine promoter) or from mammalian viruses (e.g., the adenovirus late promoter; the 
vaccinia virus 7.5 K promoter). 

In bacterial systems, a number of expression vectors may be advantageously selected 

20 depending upon the use intended for the gene protein being expressed. For example, when a large 
quantity of such a protein is to be produced, for the generation of antibodies or to screen peptide 
libraries, for example, vectors that direct the expression of high levels of fusion protein products that 
are readily purified may be desirable. Such vectors include, but are not limited, to the E. coli 
expression vector pUR278 (Ruther et al., EMBO J., 2:1791-94 (1983)), in which the gene protein 

25 coding sequence may be ligated individually into the vector in frame with the lac Z coding region so 
that a fusion protein is produced; pIN vectors (Inouye & Laouye, Nucleic Acids Res., 13:3101-09 
(1985); Van Heeke et al, J. Biol. Chem., 264:5503-9 (1989)); and the like. pGEX vectors may also be 
used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST). In 
general, such fusion proteins are soluble and can easily be purified from lysed cells by adsorption to 

30 glutathione-agarose beads followed by elution in the presence of free glutathione. The pGEX vectors 
are designed to include thrombin or factor Xa protease cleavage sites so that the cloned GPCR gene 
protein can be released from the GST moiety. 

In a preferred embodiment, full length cDNA sequences are appended with in-frame Bam HI 
sites at the amino terminus and Eco RI sites at the carboxyl terminus using standard PCR methodolo- 

35 gies (Innis, et al. (eds) PCR Protocols: A Guide to Methods and Applications, Academic Press, San 
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Diego (1990)) and ligated into the pGEX-2TK vector (Pharmacia, Uppsala, Sweden). The resulting 
cDNA construct contains a kinase recognition site at the amino terminus for radioactive labeling and 
glutathione S-transferase sequences at the carboxyl terminus for affinity purification (Nilsson, et al., 
EMBO J., 4: 1075-80 (1985); Zabeau et al, EMBO J., 1: 1217-24 (1982)). 
5 In an insect system, Autographa calif ornica nuclear polyhidrosis virus (AcNPV) is used as a 

vector to express foreign genes. The virus grows in Spodoptera frugiperda cells. The gene coding 
sequence may be cloned individually into non-essential regions (for example the polyhedrin gene) of 
the virus and placed under control of an AcNPV promoter (for example the polyhedrin promoter). 
Successful insertion of gene coding sequence will result in inactivation of the polyhedrin gene and 

10 production of non-occluded recombinant virus (ie. , virus lacking the proteinaceous coat coded for by 
the polyhedrin gene). These recombinant viruses are then used to infect Spodoptera frugiperda cells 
in which the inserted gene is expressed (see, e.g., Smith, et al, J. Virol 46: 584-93 (1983); U.S. 
Patent No. 4,745,051). 

In mammalian host cells, a number of viral-based expression systems may be utilized. In 

15 cases where an adenovirus is used as an expression vector, the gene coding sequence of interest may 
be ligated to an adenovirus transcription/translation control complex, e.g., the late promoter and 
tripartite leader sequence. This chimeric gene may then be inserted in the adenovirus genome by in 
vitro or in vivo recombination. Insertion in a non-essential region of the viral genome (e.g., region El 
or E3) will result in a recombinant virus that is viable and capable of expressing gene protein in 

20 infected hosts, (e.g., see Logan et al., Proc. Natl. Acad. ScL USA, 81:3655-59 (1984)). Specific 
initiation signals may also be required for efficient translation of inserted gene coding sequences. 
These signals include the ATG initiation codon and adjacent sequences. In cases where an entire 
gene, including its own initiation codon and adjacent sequences, is inserted into the appropriate 
expression vector, no additional translational control signals may be needed. However, in cases where 

25 only a portion of the gene coding sequence is inserted, exogenous translational control signals, 

including, perhaps, the ATG initiation codon, must be provided. Furthermore, the initiation codon 
must be in phase with the reading frame of the desired coding sequence to ensure translation of the 
entire insert. These exogenous translational control signals and initiation codons can be of a variety of 
origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of 

30 appropriate transcription enhancer elements, transcription terminators, etc. (see Bitter, et al., Methods 
in EnzymoL, 153:516^4 (1987)). 

In addition, a host cell strain may be chosen that modulates the expression of the inserted 
sequences, or modifies and processes the gene product in the specific fashion desired. Such 
modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein products may be 

35 important for the function of the protein. Different host cells have characteristic and specific 
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mechanisms for the post-translational processing and modification of proteins. Appropriate cell lines 
or host systems can be chosen to ensure the correct modification and processing of the foreign protein 
expressed. To this end, eukaryotic host cells that possess the cellular machinery for proper processing 
of the primary transcript, glycosylation, and phosphorylation of the gene product may be used. Such 
5 mammalian host cells include but are not limited to CHO, VERO, BHK, HeLa, COS, MDCK, 293, 
3T3, WI38, etc. 

For long-term, high-yield production of recombinant proteins, stable expression is preferred. 
For example, cell lines that stably express the gene protein may be engineered. Rather than using 
expression vectors that contain viral origins of replication, host cells can be transformed with DNA 

10 controlled by appropriate expression control elements (e.g., promoter, enhancer, sequences, transcrip- 
tion terminators, polyadenylation sites, etc.), and a selectable marker. Following the introduction of 
the foreign DNA, engineered cells may be allowed to grow for 1-2 days in an enriched media, and 
then are switched to a selective media. The selectable marker in the recombinant plasmid confers 
resistance to the selection and allows cells that stably integrate the plasmid into their chromosomes 

15 and grow, to form foci, which in turn can be cloned and expanded into cell lines. This method may 
advantageously be used to engineer cell lines that express the gene protein. Such engineered cell lines 
may be particularly useful in screening and evaluation of compounds that affect the endogenous 
activity of the gene protein. 

In a preferred embodiment, timing and/or quantity of expression of the recombinant protein 

20 can be controlled using an inducible expression construct. Inducible constructs and systems for 

inducible expression of recombinant proteins will be well known to those skilled in the art. Examples 
of such inducible promoters or other gene regulatory elements include, but are not limited to, tetra- 
cycline, metallothionine, ecdysone, and other steroid-responsive promoters, rapamycin responsive 
promoters, and the like (No, et al, Proc. Natl. Acad Sci. USA, 93:3346-51 (1996); Furth, et al, Proc. 

25 Natl. Acad ScL USA, 91 :9302-6 (1994)). Additional control elements that can be used include 

promoters requiring specific transcription factors such as viral, particularly HIV, promoters. In one in 
embodiment, a Tet inducible gene expression system is utilized. (Gossen et al., Proc. Natl Acad Sci. 
USA, 89:5547-51 (1992); Gossen, etal., Science, 268:1766-69 (1995)). Tet Expression Systems are 
based on two regulatory elements derived from the tetracycline-resistance operon of the E. coli TnlO 

30 transposon — the tetracycline repressor protein (TetR) and the tetracycline operator sequence (tetO) to 
which TetR binds. Using such a system, expression of the recombinant protein is placed under the 
control of the tetO operator sequence and transfected or transformed into a host cell. In the presence 
of TetR, which is co-transfected into the host cell, expression of the recombinant protein is repressed 
due to binding of the TetR protein to the tetO regulatory element. High-level, regulated gene 

35 expression can then be induced in response to varying concentrations of tetracycline (Tc) or Tc 
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derivatives such as doxycycline (Dox), which compete with tetO elements for binding to TetR. Con- 
structs and materials for tet inducible gene expression are available commercially from CLONTECH 
Laboratories, Inc., Palo Alto, CA. 

When used as a component in an assay system, the gene protein may be labeled, either 
5 directly or indirectly, to facilitate detection of a complex formed between the gene protein and a test 
substance. Any of a variety of suitable labeling systems may be used including but not limited to 
radioisotopes such as 125 I; enzyme labeling systems that generate a detectable calorimetric signal or 
light when exposed to substrate; and fluorescent labels. Where recombinant DNA technology is used 
to produce the gene protein for such assay systems, it may be advantageous to engineer fusion 

10 proteins that can facilitate labeling, immobilization and/or detection. 

Indirect labeling involves the use of a protein, such as a labeled antibody, which specifically 
binds to the gene product. Such antibodies include but are not limited to polyclonal, monoclonal, 
chimeric, single chain, Fab fragments and fragments produced by a Fab expression library. 
Production of Antibodies 

15 Described herein are methods for the production of antibodies capable of specifically 

recognizing one or more epitopes. Such antibodies may include, but are not limited to polyclonal 
antibodies, monoclonal antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies, 
Fab fragments, F(ab')2 fragments, fragments produced by a Fab expression library, anti-idiotypic (anti- 
Id) antibodies, and epitope-binding fragments of any of the above. Such antibodies may be used, for 

20 example, in the detection of a GPCR gene in a biological sample, or, alternatively, as a method for the 
inhibition of abnormal GPCR gene activity. Thus, such antibodies may be utilized as part of disease 
treatment methods, and/or may be used as part of diagnostic techniques whereby patients may be 
tested for abnormal levels of GPCR gene proteins, or for the presence of abnormal forms of such 
proteins. 

25 For the production of antibodies, various host animals may be immunize d by injection with 

the GPCR gene, its expression product or a portion thereof. Such host animals may include but are not 
limited to rabbits, mice, rats, goats and chickens, to name but a few. Various adjuvants may be used 
to increase the immunological response, depending on the host species, including but not limited to 
Freund f s (complete and incomplete), mineral gels such as aluminum hydroxide, surface active 

30 substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, dinitrophenol, and potentially useful human adjuvants such as BCG (bacille Calmette- 
Gueriri) and Corynebacterium parvwn. 

Polyclonal antibodies are heterogeneous populations of antibody molecules derived from the 
sera of animals immunized with an antigen, such as GPCR gene product, or an antigenic functional 

35 derivative thereof. For the production of polyclonal antibodies, host animals such as those described 
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above, may be immunized by injection with gene product supplemented with adjuvants as also 
described above. 

Monoclonal antibodies, which are homogeneous populations of antibodies to a particular 
antigen, may be obtained by any technique that provides for the production of antibody molecules by 
5 continuous cell lines in culture. These include, but are not limited to the hybridoma technique of 
Kohler and Milstein, Nature, 256:495-7 (1975); and U.S. Patent No. 4,376,110), the human B-cell 
hybridoma technique (Kosbor, et al., Invnunology Today, 4:72 (1983); Cote, et al., Proc. Nad. Acad. 
Set USA, 80:2026-30 (1983)), and the EBV-hybridoma technique (Cole, et al, in Monoclonal 
Antibodies And Cancer Therapy, Alan R. Liss, Inc., New York, pp. 77-96 (1985)). Such antibodies 

10 may be of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. The 
hybridoma producing the mAb of this invention may be cultivated in vitro or in vivo. Production of 
high titers of mAbs in vivo makes this the presently preferred method of production. 

In addition, techniques developed for the production of "chimeric antibodies" (Morrison, et 
al, Proc. Natl. Acad. ScL, 81:6851-6855 (1984); Takeda, et al, Nature, 314:452-54 (1985)) by 

15 splicing the genes from a mouse antibody molecule of appropriate antigen specificity together with 
genes from a human antibody molecule of appropriate biological activity can be used. A chimeric 
antibody is a molecule in which different portions are derived from different animal species, such as 
those having a variable region derived from a murine mAb and a human immunoglobulin constant 
region. 

20 Alternatively, techniques described for the production of single chain antibodies (U.S. Patent 

No. 4,946,778; Bird, Science 242:423-26 (1988); Huston, et al., Proc. Natl Acad. ScL USA, 85:5879- 
83 (1988); and Wand, et al., Nature, 334:544^6 (1989)) can be adapted to produce gene-single chain 
antibodies. Single chain antibodies are typically formed by linking the heavy and light chain 
fragments of the Fv region via an amino acid bridge, resulting in a single chain polypeptide. 

25 Antibody fragments that recognize specific epitopes may be generated by known techniques. 

For example, such fragments include but are not limited to: the F(ab r ) 2 fragments that can be produced 
by pepsin digestion of the antibody molecule and the Fab fragments that can be generated by reducing 
the disulfide bridges of the F(ab , )2 fragments. Alternatively, Fab expression libraries may be 
constructed (Huse, et al., Science, 246:1275-81 (1989)) to allow rapid and easy identification of 

30 monoclonal Fab fragments with the desired specificity. 
Screening Methods 

The present invention may be employed in a process for screening for agents such as agonists, 
ie. agents that bind to and activate GPCR polypeptides, or antagonists, i.e. inhibit the activity or 
interaction of GPCR polypeptides with its ligand. Thus, polypeptides of the invention may also be 
35 used to assess the binding of small molecule substrates and ligands in, for example, cells, cell-free 
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preparations, chemical libraries, and natural product mixtures as known in the art. Any methods 
routinely used to identify and screen for agents that can modulate receptors may be used in accor- 
dance with the present invention. 

The present invention provides methods for identifying and screening for agents that 
5 modulate GPCR expression or function. More particularly, cells that contain and express GPCR gene 
sequences may be used to screen for therapeutic agents. Such cells may include non-recombinant 
monocyte cell lines, such as U937 (ATCC# CRL-1593), THP-1 (ATCC# TB-202), and P388D1 
(ATCC# TIB-63); endothelial cells such as HUVECs and bovine aortic endothelial cells (BAECs); as 
well as generic mammalian cell lines such as HeLa cells and COS cells, e.g., COS-7 (ATCC# CRL- 

10 1651). Further, such cells may include recombinant, transgenic cell lines. For example, the trans- 
genic mice of the invention may be used to generate cell lines, containing one or more cell types 
involved in a disease, that can be used as cell culture models for that disorder. While cells, tissues, 
and primary cultures derived from the disease transgenic animals of the invention may be utilized, the 
generation of continuous cell lines is preferred. For examples of techniques that may be used to 

15 derive a continuous cell line from the transgenic animals, see Small, et al, MoL Cell Biol, 5:642^8 
(1985). 

GPCR gene sequences may be introduced into, and overexpressed in, the genome of the cell 
of interest. In order to overexpress a GPCR gene sequence, the coding portion of the GPCR gene 
sequence may be ligated to a regulatory sequence that is capable of driving gene expression in the cell 

20 type of interest. Such regulatory regions will be well known to those of skill in the art, and may be 
utilized in the absence of undue experimentation. GPCR gene sequences may also be disrupted or 
underexpressed. Cells having GPCR gene disruptions or underexpressed GPCR gene sequences may 
be used, for example, to screen for agents capable of affecting alternative pathways that compensate 
for any loss of function attributable to the disruption or underexpression. 

25 In vitro systems may be designed to identify compounds capable of binding the GPCR gene 

products. Such compounds may include, but are not limited to, peptides made of D-and/or L- 
configuration amino acids (in, for example, the form of random peptide libraries; (see e.g., Lam, etal., 
Nature, 354:82-4 (1991)), phosphopeptides (in, for example, the form of random or partially degene- 
rate, directed phosphopeptide libraries; see, e.g., Songyang, etal, Cell, 72:767-78 (1993)), antibodies, 

30 and small organic or inorganic molecules. Compounds identified may be useful, for example, in 

modulating the activity of GPCR gene proteins, preferably mutant GPCR gene proteins; elaborating 
the biological function of the GPCR gene protein; or screening for compounds that disrupt normal 
GPCR gene interactions or themselves disrupt such interactions. 

The principle of the assays used to identify compounds that bind to the GPCR gene protein 

35 involves preparing a reaction mixture of the GPCR gene protein and the test compound under 
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conditions and for a time sufficient to allow the two components to interact and bind, thus forming a 
complex that can be removed and/or detected in the reaction mixture. These assays can be conducted 
in a variety of ways. For example, one method to conduct such an assay would involve anchoring the 
GPCR gene protein or the test substance onto a solid phase and detecting target protein/test substance 

' 5 complexes anchored on the solid phase at the end of the reaction. In one embodiment of such a 

method, the GPCR gene protein may be anchored onto a solid surface, and the test compound, which 
is not anchored, may be labeled, either directly or indirectly. 

In practice, microtitre plates are conveniently utilized. The anchored component may be 
immobilized by non-covalent or covalent attachments. Non-covalent attachment may be accom- 

10 plished simply by coating the solid surface with a solution of the protein and drying. Alternatively, an 
immobilized antibody, preferably a monoclonal antibody, specific for the protein may be used to 
anchor the protein to the solid surface. The surfaces may be prepared in advance and stored. 

In order to conduct the assay, the nonimmobilized component is added to the coated surface 
containing the anchored component. After the reaction is complete, unreacted components are 

15 removed (e.g. , by washing) under conditions such that any complexes formed will remain immobi- 
lized on the solid surface. The detection of complexes anchored on the solid surface can be accom- 
plished in a number of ways. Where the previously nonimmobilized component is pre-labeled, the 
detection of label immobilized on the surface indicates that complexes were formed. Where the 
previously nonimmobilized component is not pre-labeled, an indirect label can be used to detect 

20 complexes anchored on the surface; e.g. , using a labeled antibody specific for the previously 

nonimmobilized component (the antibody, in turn, may be directly labeled or indirectly labeled with a 
labeled anti-Ig antibody). 

Alternatively, a reaction can be conducted in a liquid phase, the reaction products separated 
from unreacted components, and complexes detected; e.g., using an immobilized antibody specific for 

25 GPCR gene product or the test compound to anchor any complexes formed in solution, and a labeled 
antibody specific for the other component of the possible complex to detect anchored complexes. 

Compounds that are shown to bind to a particular GPCR gene product through one of the 
methods described above can be further tested for their ability to elicit a biochemical response from 
the GPCR gene protein. Agonists, antagonists and/or inhibitors of the expression product can be 

30 identified utilizing assays well known in the art. 
Antisense. Ribozvmes. and Antibodies 

Other agents that may be used as therapeutics include the GPCR gene, its expression 
product(s) and functional fragments thereof. Additionally, agents that reduce or inhibit mutant GPCR 
gene activity may be used to ameliorate disease symptoms. Such agents include antisense, ribozyme, 



WO 02/03793 PCT/US01/21923 

-32- 

and triple helix molecules. Techniques for the production and use of such molecules are well known 
to those of skill in the art. 

Anti-sense RNA and DNA molecules act to directly block the translation of mRNA by 
hybridizing to targeted mRNA and preventing protein translation. With respect to antisense DNA, 
5 oligodeoxyribonucleotides derived from the translation initiation site, e.g., between the -10 and +10 
regions of the GPCR gene nucleotide sequence of interest, are preferred. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by an endonucleolytic cleavage. The composition 

10 of ribozyme molecules must include one or more sequences complementary to the GPCR gene 

mRNA, and must include the well known catalytic sequence responsible for mRNA cleavage. For this 
sequence, see U.S. Patent No. 5,093,246, which is incorporated by reference herein in its entirety. As 
such within the scope of the invention are engineered hammerhead motif ribozyme molecules that 
specifically and efficiently catalyze endonucleolytic cleavage of RNA sequences encoding GPCR 

15 gene proteins. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the molecule of interest for ribozyme cleavage sites that include the following sequences, 
GUA, GUU and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides 
corresponding to the region of the GPCR gene containing the cleavage site may be evaluated for 
20 predicted structural features, such as secondary structure, that may render the oligonucleotide 

sequence unsuitable. The suitability of candidate sequences may also be evaluated by testing their 
accessibility to hybridization with complementary oligonucleotides, using ribonuclease protection 
assays. 

Nucleic acid molecules to be used in triple helix formation for the inhibition of transcription 
25 should be single stranded and composed of deoxyribonucleotides. The base composition of these 

oligonucleotides must be designed to promote triple helix formation via Hoogsteen base pairing rules, 
which generally require sizeable stretches of either purines or pyrimidines to be present on one strand 
of a duplex. Nucleotide sequences may be pyrimidine-based, which will result in TAT and CGC 
triplets across the three associated strands of the resulting triple helix. The pyrimidine-rich molecules 
30 provide base complementarity to a purine-rich region of a single strand of the duplex in a parallel 
orientation to that strand. In addition, nucleic acid molecules may be chosen that are purine-rich, for 
example, containing a stretch of G residues. These molecules will form a triple helix with a DNA 
duplex that is rich in GC pairs, in which the majority of the purine residues are located on a single 
strand of the targeted duplex, resulting in GGC triplets across the three strands in the triplex. 
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Alternatively, the potential sequences that can be targeted for triple helix formation may be 
increased by creating a so called "switchback" nucleic acid molecule. Switchback molecules are 
synthesized in an alternating 5-3', 3-5' manner, such that they base pair with first one strand of a 
duplex and then the other, eliminating the necessity for a sizeable stretch of either purines or 
5 pyrimidines to be present on one strand of a duplex. 

It is possible that the antisense, ribozyme, and/or triple helix molecules described herein may 
reduce or inhibit the transcription (triple helix) and/or translation (antisense, ribozyme) of mRNA 
produced by both normal and mutant GPCR gene alleles. In order to ensure that substantially normal 
levels of GPCR gene activity are maintained, nucleic acid molecules that encode and express GPCR 
10 gene polypeptides exhibiting normal activity may be introduced into cells that do not contain 

sequences susceptible to whatever antisense, ribozyme, or triple helix treatments are being utilized. 
Alternatively, it may be preferable to coadminister normal GPCR gene protein into the cell or tissue in 
order to maintain the requisite level of cellular or tissue GPCR gene activity. 

Anti-sense RNA and DNA, ribozyme, and triple helix molecules of the invention may be 
15 prepared by any method known in the art for the synthesis of DNA and RNA molecules. These 

include techniques for chemically synthesizing oligodeoxyribonucleotides and oligoribonucleotides 
well known in the art such as for example solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding the antisense RNA molecule. Such DNA sequences may be incorporated into a 
20 wide variety of vectors that incorporate suitable RNA polymerase promoters such as the T7 or SP6 
polymerase promoters. Alternatively, antisense cDNA constructs that synthesize antisense RNA 
constitutively or inducibly, depending on the promoter used, can be introduced stably into cell lines. 

Various well-known modifications to the DNA molecules may be introduced as a means of 
increasing intracellular stability and half-life. Possible modifications include but are not limited to the 
25 addition of flanking sequences of ribonucleotides or deoxyribonucleotides to the 5' and/or 3' ends of 
the molecule or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages 
within the oligodeoxyribonucleotide backbone. 

Antibodies that are both specific for GPCR gene protein, and in particular, mutant gene 
protein, and interfere with its activity may be used to inhibit mutant GPCR gene function. Such 
30 antibodies may be generated against the proteins themselves or against peptides corresponding to 

portions of the proteins using standard techniques known in the art and as also described herein. Such 
antibodies include but are not limited to polyclonal, monoclonal, Fab fragments, single chain 
antibodies, chimeric antibodies, etc. 

In instances where the GPCR gene protein is intracellular and whole antibodies are used, 
35 internalizing antibodies may be preferred. However, lipofectin liposomes may be used to deliver the 
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antibody or a fragment of the Fab region that binds to the GPCR gene epitope into cells. Where 
fragments of the antibody are used, the smallest inhibitory fragment that binds to the target or 
expanded target protein's binding domain is preferred. For example, peptides having an amino acid 
sequence corresponding to the domain of the variable region of the antibody that binds to the GPCR 
5 gene protein may be used. Such peptides may be synthesized chemically or produced via recombinant 
DNA technology using methods well known in the art (see, e.g., Creighton, Proteins: Structures and 
Molecular Principles (1984) W.H. Freeman, New York 1983, supra; and Sambrook, et cd. y 1989, 
supra). Alternatively, single chain neutralizing antibodies that bind to intracellular GPCR gene 
epitopes may also be administered. Such single chain antibodies may be administered, for example, 

10 by expressing nucleotide sequences encoding single-chain antibodies within the target cell population 
by utilizing, for example, techniques such as those described in Marasco, et al, Proc. Natl Acad. ScL 
USA, 90:7889-93 (1993). 

RNA sequences encoding GPCR gene protein may be directly administered to a patient 
exhibiting disease symptoms, at a concentration sufficient to produce a level of GPCR gene protein 

15 such that disease symptoms are ameliorated. Patients may be treated by gene replacement therapy. 
One or more copies of a normal GPCR gene, or a portion of the gene that directs the production of a 
normal GPCR gene protein with GPCR gene function, may be inserted into cells using vectors that 
include, but are not limited to adenovirus, adeno-associated virus, and retrovirus vectors, in addition 
to other particles that introduce DNA into cells, such as liposomes. Additionally, techniques such as 

20 those described above may be utilized for the introduction of normal GPCR gene sequences into 
human cells. 

Cells, preferably, autologous cells, containing normal GPCR gene expressing gene sequences 
may then be introduced or reintroduced into the patient at positions that allow for the amelioration of 
disease symptoms. 

25 Pharmaceutical Compositions. Effective Dosages, and Routes of Administration 

The identified compounds that inhibit target mutant gene expression, synthesis and/or activity 
can be administered to a patient at therapeutically effective doses to treat or ameliorate the disease. A 
therapeutically effective dose refers to that amount of the compound sufficient to result in ameliora- 
tion of symptoms of the disease. 

30 Toxicity and therapeutic efficacy of such compounds can be determined by standard pharma- 

ceutical procedures in cell cultures or experimental animals, e.g., for determining the LD50 (the dose 
lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% of the popula- 
tion). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio LD 5Q fED 50 . Compounds that exhibit large therapeutic indices are preferred. 

35 While compounds that exhibit toxic side effects may be used, care should be taken to design a 
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delivery system that targets such compounds to the site of affected tissue in order to minimize 
potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in formulating a 
range of dosage for use in humans. The dosage of such compounds lies preferably within a range of 
5 circulating concentrations that include the ED50 with little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and the route of administration utilized. For 
any compound used in the method of the invention, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be formulated in animal models to achieve a circulating 
plasma concentration range that includes the IQo (i.e., the concentration of the test compound that 

10 achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can 
be used to more accurately determine useful doses in humans. Levels in plasma may be measured, for 
example, by high performance liquid chromatography. 

Pharmaceutical compositions for use in accordance with the present invention may be 
formulated in conventional manner using one or more physiologically acceptable carriers or 

15 excipients. Thus, the compounds and their physiologically acceptable salts and solvates may be 

formulated for administration by inhalation or insufflation (either through the mouth or the nose) or 
oral, buccal, parenteral, topical, subcutaneous, intraperitoneal, intravenous, intrapleural, intraoccular, 
intraarterial, or rectal administration. It is also contemplated that pharmaceutical compositions may 
be administered with other products that potentiate the activity of the compound and optionally, may 

20 include other therapeutic ingredients. 

For oral administration, the pharmaceutical compositions may take the form of, for example, 
tablets or capsules prepared by conventional means with pharmaceutically acceptable excipients such 
as binding agents (e.g., pregelatinized maize starch, polyvinylpyrrolidone or hydroxypropyl methyl- 
cellulose); fillers (e.g., lactose, microcrystailine cellulose or calcium hydrogen phosphate); lubricants 

25 (e.g., magnesium stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch glycolate); 
or wetting agents (e.g., sodium lauryl sulphate). The tablets may be coated by methods well known in 
the art Liquid preparations for oral administration may take the form of, for example, solutions, 
syrups or suspensions, or they may be presented as a dry product for constitution with water or other 
suitable vehicle before use. Such liquid preparations may be prepared by conventional means with 

30 pharmaceutically acceptable additives such as suspending agents (e.g., sorbitol syrup, cellulose 

derivatives or hydrogenated edible fats); emulsifying agents (e.g., lecithin or acacia); non-aqueous 
vehicles (e.g., almond oil, oily esters, ethyl alcohol or fractionated vegetable oils); and preservatives 
(e.g., methyl or propyl-p-hydroxybenzoates or sorbic acid). The preparations may also contain buffer 
salts, flavoring, coloring and sweetening agents as appropriate. 
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Preparations for oral administration may be suitably formulated to give controlled release of 
the active compound. 

For buccal administration the compositions may take the form of tablets or lozenges 
formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present invention 
are convenientiy delivered in the form of an aerosol spray presentation from pressurized packs or a 
nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a pressurized aerosol 
the dosage unit may be determined by providing a valve to deliver a metered amount. Capsules and 
cartridges of e.g. gelatin for use in an inhaler or insufflator may be formulated containing a powder 
mix of the compound and a suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.g., by bolus 
injection or continuous infusion. Formulations for injection may be presented in unit dosage form, 
e.g., in ampoules or in multi-dose containers, with an added preservative. The compositions may take 
such forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain 
formulatory agents such as suspending, stabilizing and/or dispersing agents. Alternatively, the active 
ingredient may be in powder form for constitution with a suitable vehicle, e.g., sterile pyrogen-free 
water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories or 
retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 
glycerides. Oral ingestion is possibly the easiest method of taking any medication. Such a route of 
administration, is generally simple and straightforward and is frequently the least inconvenient or 
unpleasant route of administration from the patient's point of view. However, this involves passing 
the material through the stomach, which is a hostile environment for many materials, including 
proteins and other biologically active compositions. As the acidic, hydrolytic and proteolytic 
environment of the stomach has evolved efficiently to digest proteinaceous materials into amino acids 
and oligopeptides for subsequent anabolism, it is hardly surprising that very little or any of a wide 
variety of biologically active proteinaceous material, if simply taken orally, would survive its passage 
through the stomach to be taken up by the body in the small intestine. The result, is that many 
proteinaceous medicaments must be taken in through another method, such as parenterally, often by 
subcutaneous, intramuscular or intravenous injection. 

Pharmaceutical compositions may also include various buffers (e.g., Tris, acetate, phosphate), 
solubilizers (e.g. 9 Tween, Polysorbate), carriers such as human serum albumin, preservatives 
(thimerosol, benzyl alcohol) and antioxidants such as ascorbic acid in order to stabilize 
pharmaceutical activity. The stabilizing agent may be a detergent, such as tween-20, tween-80, NPAO 
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or Triton X-100. EBP may also be incorporated into particulate preparations of polymeric compounds 
for controlled delivery to a patient over an extended period of time. A more extensive survey of 
components in pharmaceutical compositions is found in Remington's Pharmaceutical Sciences, 18th 
ed., A. R. Gennaro, ed., Mack Publishing, Easton, Pa. (1990). 
5 In addition to the formulations described previously, the compounds may also be formulated 

as a depot preparation. Such long acting formulations may be administered by implantation (for 
example subcutaneously or intramuscularly) or by intramuscular injection. Thus, for example, the 
compounds may be formulated with suitable polymeric or hydrophobic materials (for example as an 
emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, 

10 as a sparingly soluble salt. 

The compositions may, if desired, be presented in a pack or dispenser device that may contain 
one or more unit dosage forms containing the active ingredient The pack may for example comprise 
metal or plastic foil, such as a blister pack, The pack or dispenser device may be accompanied by 
instructions for administration. 

15 Diagnostics 

A variety of methods may be employed to diagnose disease conditions associated with the 
GPCR gene. Specifically, reagents may be used, for example, for the detection of the presence of 
GPCR gene mutations, or the detection of either over or under expression of GPCR gene mRNA. 

According to the diagnostic and prognostic method of the present invention, alteration of the 

20 wild-type GPCR gene locus is detected. In addition, the method can be performed by detecting the 
wild-type GPCR gene locus and confirming the lack of a predisposition or neoplasia. "Alteration of a 
wild-type gene" encompasses all forms of mutations including deletions, insertions and point muta- 
tions in the coding and noncoding regions. Deletions may be of the entire gene or only a portion of 
the gene. Point mutations may result in stop codons, frameshift mutations or amino acid substitutions. 

25 Somatic mutations are those that occur only in certain tissues, e.g., in tumor tissue, and are not 
inherited in the germline. Germline mutations can be found in any of a body's tissues and are 
inherited. If only a single allele is somatically mutated, an early neoplastic state may be indicated. 
However, if both alleles are mutated, then a late neoplastic state may be indicated. The finding of 
gene mutations thus provides both diagnostic and prognostic information. A GPCR gene allele that is 

30 not deleted (e.g., that found on the sister chromosome to a chromosome carrying a GPCR gene 

deletion) can be screened for other mutations, such as insertions, small deletions, and point mutations. 
Mutations found in tumor tissues may be linked to decreased expression of the GPCR gene product. 
However, mutations leading to non-functional gene products may also be linked to a cancerous state. 
Point mutational events may occur in regulatory regions, such as in the promoter of the gene, leading 

35 to loss or diminution of expression of the mRNA. Point mutations may also abolish proper RNA 
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processing, leading to loss of expression of the GPCR gene product, or a decrease in mRNA stability 
or translation efficiency. 

One test available for detecting mutations in a candidate locus is to directly compare genomic 
target sequences from cancer patients with those from a control population. Alternatively, one could 
5 sequence messenger RNA after amplification, e.g., by PCR, thereby eliminating the necessity of 
determining the exon structure of the candidate gene. Mutations from cancer patients falling outside 
the coding region of the GPCR gene can be detected by examining the non-coding regions, such as 
introns and regulatory sequences near or within the GPCR gene. An early indication that mutations in 
noncoding regions are important may come from Northern blot experiments that reveal messenger 

10 RNA molecules of abnormal size or abundance in cancer patients as compared to control individuals. 

The methods described herein may be performed, for example, by utilizing pre-packaged 
diagnostic kits comprising at least one specific gene nucleic acid or anti-gene antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings, to diagnose patients 
exhibiting disease symptoms or at risk for developing disease. 

15 Any cell type or tissue in which the gene is expressed may be utilized in the diagnostics 

described below. 

DNA or RNA from the cell type or tissue to be analyzed may easily be isolated using 
procedures that are well known to those in the art. Diagnostic procedures may also be performed in 
situ directly upon tissue sections (fixed and/or frozen) of patient tissue obtained from biopsies or 

20 resections, such that no nucleic acid purification is necessary. Nucleic acid reagents may be used as 
probes and/or primers for such in situ procedures (see, for example, Nuovo, PCR In Situ 
Hybridization:Protocols and Applications, Raven Press, N.Y. (1992)). 

Gene nucleotide sequences, either RNA or DNA, may, for example, be used in hybridization 
or amplification assays of biological samples to detect disease-related gene structures and expression. 

25 Such assays may include, but are not limited to, Southern or Northern analyses, restriction fragment 
length polymorphism assays, single stranded conformational polymorphism analyses, in situ hybridi- 
zation assays, and polymerase chain reaction analyses. Such analyses may reveal both quantitative 
aspects of the expression pattern of the gene, and qualitative aspects of the gene expression and/or 
gene composition. That is, such aspects may include, for example, point mutations, insertions, 

30 deletions, chromosomal rearrangements, and/or activation or inactivation of gene expression. 

Preferred diagnostic methods for the detection of gene-specific nucleic acid molecules may 
involve for example, contacting and incubating nucleic acids, derived from the cell type or tissue 
being analyzed, with one or more labeled nucleic acid reagents under conditions favorable for the 
specific annealing of these reagents to their complementary sequences within the nucleic acid 

35 molecule of interest Preferably, the lengths of these nucleic acid reagents are at least 9 to 30 
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nucleotides. After incubation, all non-annealed nucleic acids are removed from the nucleic 
acid:fingerprint molecule hybrid. The presence of nucleic acids from the fingerprint tissue that have 
hybridized, if any such molecules exist, is then detected. Using such a detection scheme, the nucleic 
acid from the tissue or cell type of interest may be immobilized, for example, to a solid support such 
5 as a membrane, or a plastic surface such as that on a microtitre plate or polystyrene beads. In this 
case, after incubation, non-annealed, labeled nucleic acid reagents are easily removed. Detection of 
the remaining, annealed, labeled nucleic acid reagents is accomplished using standard techniques 
well-known to those in the art. 

Alternative diagnostic methods for the detection of gene-specific nucleic acid molecules may 

10 involve their amplification, e.g., by PCR (the experimental embodiment set forth in Mullis U.S. Patent 
No. 4,683,202 (1987)), ligase chain reaction (Barany, Proc. Natl Acad. Sci. USA, 88:189-93 (1991)), 
self sustained sequence replication (Guatelli, et al, Proc. Natl Acad. Sci. USA, 87:1874-78 (1990)), 
transcriptional amplification system (Kwoh, et dL % Proc. Natl. Acad. ScL USA, 86:1 173-77 (1989)), 
Q-Beta Replicase (Lizardi et al , Bio/Technology, 6: 1 197 (1988)), or any other nucleic acid 

15 amplification method, followed by the detection of the amplified molecules using techniques well 
known to those of skill in the art. These detection schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in very low numbers. 

In one embodiment of such a detection scheme, a cDNA molecule is obtained from an RNA 
molecule of interest (e.g., by reverse transcription of the RNA molecule into cDNA). Cell types or 

20 tissues from which such RNA may be isolated include any tissue in which wild type fingerprint gene 
is known to be expressed. For example, cell types useful in the detection of the melanocortin-3 
receptor gene include, but are not limited to, spermatogenic cells of the seminiferous tubules in the 
testes; cell types useful in the detection of the5-HT2-C receptor gene include, but are not limited to, 
testis and eye; cell types useful in the detection of the chemokine receptor 9A gene include, but are 

25 not limited to, testis, tongue and esophagus, lung, spleen, thymus, lymph nodes, bone marrow, 

salivary gland, skeletal muscle and small intestine; cell types useful in the detection of the glucocorti- 
coid-induced receptor gene include, but are not limited to, to brain, pharynx, testis and prostate. A 
sequence within the cDNA is then used as the template for a nucleic acid amplification reaction, such 
as a PCR amplification reaction, or the like. The nucleic acid reagents used as synthesis initiation 

30 reagents (e.g. , primers) in the reverse transcription and nucleic acid amplification steps of this method 
may be chosen from among the gene nucleic acid reagents described herein. The preferred lengths of 
such nucleic acid reagents are at least 15-30 nucleotides. For detection of the amplified product, the 
nucleic acid amplification may be performed using radioactively or non-radioactively labeled 
nucleotides. Alternatively, enough amplified product may be made such that the product may be 
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visualized by standard ethidium bromide staining or by utilizing any other suitable nucleic acid 
staining method. 

Antibodies directed against wild type or mutant gene peptides may also be used as disease 
diagnostics and prognostics. Such diagnostic methods, may be used to detect abnormalities in the 
5 level of gene protein expression, or abnormalities in the structure and/or tissue, cellular, or subcellular 
location of fingerprint gene protein. Structural differences may include, for example, differences in 
the size, electronegativity, or antigenicity of the mutant fingerprint gene protein relative to the normal 
fingerprint gene protein. 

Protein from the tissue or cell type to be analyzed may easily be detected or isolated using 

10 techniques that are well known to those of skill in the art, including but not limited to western blot 
analysis. For a detailed explanation of methods for carrying out western blot analysis, see Sambrook, 
et al (1989) supra, at Chapter 18. The protein detection and isolation methods employed herein may 
also be such as those described in Harlow and Lane, for example, (Antibodies: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1988)). 

15 Preferred diagnostic methods for the detection of wild type or mutant gene peptide molecules 

may involve, for example, immunoassays wherein fingerprint gene peptides are detected by their 
interaction with an anti-fingerprint gene-specific peptide antibody. 

For example, antibodies, or fragments of antibodies useful in the present invention may be 
used to quantitatively or qualitatively detect the presence of wild type or mutant gene peptides. This 

20 can be accomplished, for example, by immunofluorescence techniques employing a fluorescendy 
labeled antibody (see below) coupled with light microscopic, flow cytometric, or fluorimetric 
detection. Such techniques are especially preferred if the fingerprint gene peptides are expressed on 
the cell surface. 

The antibodies (or fragments thereof) useful in the present invention may, additionally, be 
25 employed histologically, as in immunofluorescence or immunoelectron microscopy, for in situ detec- 
tion of fingerprint gene peptides. In situ detection may be accomplished by removing a histological 
specimen from a patient, and applying thereto a labeled antibody of the present invention. The 
antibody (or fragment) is preferably applied by overlaying the labeled antibody (or fragment) onto a 
biological sample. Through the use of such a procedure, it is possible to determine not only the 
30 presence of the fingerprint gene peptides, but also their distribution in the examined tissue. Using the 
present invention, those of ordinary skill will readily perceive that any of a wide variety of 
histological methods (such as staining procedures) can be modified in order to achieve such in situ 
detection. 

Immunoassays for wild type, mutant, or expanded fingerprint gene peptides typically 
35 comprise incubating a biological sample, such as a biological fluid, a tissue extract, freshly harvested 
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cells, or cells that have been incubated in tissue culture, in the presence of a detectably labeled 
antibody capable of identifying fingerprint gene peptides, and detecting the bound antibody by any of 
a number of techniques well known in the art. 

The biological sample may be brought in contact with and immobilized onto a solid phase 
5 support or carrier such as nitrocellulose, or other solid support that is capable of immobilizing cells, 
cell particles or soluble proteins. The support may then be washed with suitable buffers followed by 
treatment with the detectably labeled gene-specific antibody. The solid phase support may then be 
washed with the buffer a second time to remove unbound antibody. The amount of bound label on 
solid support may then be detected by conventional means. 

10 The terms "solid phase support or earner" are intended to encompass any support capable of 

binding an antigen or an antibody. Well-known supports or carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, 
polyacrylamides, gabbros, and magnetite. The nature of the carrier can be either soluble to some 
extent or insoluble for the purposes of the present invention. The support material may have virtually 

15 any possible structural configuration so long as the coupled molecule is capable of binding to an 

antigen or antibody. Thus, the support configuration may be spherical, as in a bead, or cylindrical, as 
in the inside surface of a test tube, or the external surface of a rod. Alternatively, the surface may be 
flat such as a sheet, test strip, etc. Preferred supports include polystyrene beads. Those skilled in the 
art will know many other suitable carriers for binding antibody or antigen, or will be able to ascertain 

20 the same by use of routine experimentation. 

The binding activity of a given lot of anti-wild type or -mutant fingerprint gene peptide 
antibody may be determined according to well known methods. Those skilled in the art will be able 
to determine operative and optimal assay conditions for each determination by employing routine 
experimentation. 

25 One of the ways in which the gene peptide-specific antibody can be detectably labeled is by 

linking the same to an enzyme and using it in an enzyme immunoassay (EIA) (Voller, Ric Clin Lab, 
8:289-98 (1978) ["The Enzyme Linked Immunosorbent Assay (ELISA)", Diagnostic Horizons 2:1-7, 
1978, Microbiological Associates Quarterly Publication, Walkersville, Md.]; Voller, et al,J. Cliru 
Pathol, 31:507-20 (1978); Butler, Meth. EnzymoL, 73:482-523 (1981); Maggio (ed.), Enzyme 

30 Immunoassay, CRC Press, Boca Raton, Fla. (1980); Ishikawa, et al , (eds.) Enzyme Immunoassay, 
Igaku-Shoin, Tokyo (1981)). The enzyme that is bound to the antibody will react with an appropriate 
substrate, preferably a chromogenic substrate, in such a manner as to produce a chemical moiety that 
can be detected, for example, by spectrophotometric, fluorimetric or by visual means. Enzymes that 
can be used to detectably label the antibody include, but are not limited to, malate dehydrogenase, 

35 staphylococcal nuclease, delta-5-steroid isomerase, yeast alcohol dehydrogenase, alpha- 
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glycerophosphate, dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline 
phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, 
glucose-6-phosphate dehydrogenase, glucoamylase and acetylcholinesterase. The detection can be 
accomplished by colorimetric methods that employ a chromogenic substrate for the enzyme. 
5 Detection may also be accomplished by visual comparison of the extent of enzymatic reaction of a 
substrate in comparison with similarly prepared standards. 

Detection may also be accomplished using any of a variety of other immunoassays. For 
example, by radioactively labeling the antibodies or antibody fragments, it is possible to detect 
fingerprint gene wild type, mutant, or expanded peptides through the use of a radioimmunoassay 
10 (RIA) {see, e.g., Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on 

Radioligand Assay Techniques, The Endocrine Society, March, 1986). The radioactive isotope can be 
detected by such means as the use of a gamma counter or a scintillation counter or by 
autoradiography. 

It is also possible to label the antibody with a fluorescent compound. When the fluorescently 
15 labeled antibody is exposed to light of the proper wave length, its presence can then be detected due to 
fluorescence. Among the most commonly used fluorescent labeling compounds are fluorescein 
isothiocyanate, rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o-phthaldehyde and 
fluorescamine. 

The antibody can also be detectably labeled using fluorescence emitting metals such as 152 Eu, 
20 or others of the lanthanide series. These metals can be attached to the antibody using such metal 
chelating groups as diethylenetriaminepentacetic acid (DTP A) or ethylenediamine-tetraacetic acid 
(EDTA). 

The antibody also can be detectably labeled by coupling it to a chemiluminescent compound. 
The presence of the chemiluminescent-tagged antibody is then determined by detecting the presence 
25 of luminescence that arises during the course of a chemical reaction. Examples of particularly useful 
chemiluminescent labeling compounds are luminol, isoluminol, theromatic acridinium ester, 
imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label the antibody of the present 
invention. Bioluminescence is a type of chemiluminescence found in biological systems in which a 
30 catalytic protein increases the efficiency of the chemiluminescent reaction. The presence of a 
bioluminescent protein is determined by detecting the presence of luminescence. Important 
bioluminescent compounds for purposes of labeling are luciferin, lucif erase and aequorin. 

Throughout this application, various publications, patents and published patent applications 
are referred to by an identifying citation. The disclosures of these publications, patents and published 
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patent specifications referenced in this application are hereby incorporated by reference into the 

present disclosure to more fully describe the state of the art to which this invention pertains. 

The following examples are intended only to illustrate the present invention and should in no 

way be construed as limiting the subject invention. 

5 Examples 

Example 1: Generation and Analysis of Mice Comprising Melanocortin-3 Receptor Gene 
Disruptions 

To investigate the role of melanocortin-3 receptors, disruptions in melanocortin-3 receptor 
10 genes were produced by homologous recombination. Specifically, transgenic mice comprising 

disruptions in melanocortin-3 receptor genes were created. More particularly, as shown in Figure 2, a 
melanocortin-3 receptor-specific targeting construct having the ability to disrupt or modify melano- 
cortin-3 receptor genes, specifically comprising SEQ ID NO: 1 was created using as the targeting arms 
(homologous sequences) in the construct, the oligonucleotide sequences identified herein as SEQ ID 
15 NO:3orSEQEDNO:4. 

The targeting construct was introduced into ES cells derived from the 129/Sv-+P+Mgf-SU/J 
mouse substrain to generate chimeric mice. The Fl mice were generated by breeding with C57BL/6 
females, and the F2 homozygous mutant mice were produced by intercrossing Fl heterozygous males 
and females. 

20 The transgenic mice comprising disruptions in melanocortin-3 receptor genes were analyzed 

for phenotypic changes and expression patterns. The phenotypes associated with a disruption in 
nuclear receptor genes were determined The homozygous mice demonstrated at least one of the 
following phetiotypes: 
Kidney: 

25 Two homozygous mutant mice (83793, 83779) had unilateral renal agenesis, with only one 

kidney present at necropsy. One heterozygous mouse (83778) also had unilateral renal agenesis, 
which is occasionally seen in this strain of mice. Therefore, the presence of only one kidney in 
homozygous mutants is a possible phenotypic change, although it may be due to spontaneous disease 
or of nonspecific etiology. 

30 Expression*. 

Tissues of the transgenic animals were analyzed for expression of the target gene. Organs 
from one heterozygous male and one heterozygous female were frozen, sectioned (10 /mi), stained 
and analyzed for lacZ expression using X-Gal as a substrate for beta-galactosidase. Nuclear Fast Red 
was used for counters taining. 
35 Organs and tissues collected and frozen: brain, sciatic nerve, eye, Harderian glands, thymus, 

spleen, lymph nodes, bone marrow, aorta, heart, lung, liver, gallbladder, pancreas, kidney, urinary 
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bladder, trachea, larynx, esophagus, thyroid gland, pituitary gland, adrenal glands, salivary glands, 
stomach, small and large intestines, tongue, skeletal muscle, skin and reproductive system. 

In addition, the brain of the heterozygous female was analyzed for lacZ expression as 
wholemount. The dissected brain was cut longitudinally, fixed and stained using X-Gal as a substrate 
5 for beta-galactosidase. To stop the reaction the brain was washed in PBS and fixed in PBS-buffered 
formaldehyde. 

Wild type control tissues were stained for X-gal to reveal background or signals due to 
endogenous beta-galactosidase activity. The following tissues show staining in the wild-type control 
sections and are therefore not suitable for X-gal staining: small and large intestines, stomach, vas 
10 deferens and epididymis. It has been previously reported that these organs contain high levels of 
endogenous beta-galactosidase activity. 

Example 2: Generation and Analysis of Mice Comprising 5-HT2-B Gene Disruptions 

To investigate the role of 5-HT2-B, disruptions in 5-HT2-B genes were produced by 
15 homologous recombination. Specifically, transgenic mice comprising disruptions in 5-HT2-B genes 
were created. More particularly, as shown in Figure 4, an 5-HT2-B -specific targeting construct 
having the ability to disrupt or modify 5-HT2-B genes, specifically comprising SEQ ID NO:5 was 
created using as the targeting arms (homologous sequences) in the construct, the oligonucleotide 
sequences identified herein as SEQ ID NO:7 or SEQ ID NO:8. 
20 The targeting construct was introduced into ES cells derived from the 129/Sv-+P+Mgf-SU/J 

mouse substrain to generate chimeric mice. The Fl mice were generated by breeding with C57BL/6 
females, and the F2 homozygous mutant mice were produced by intercrossing Fl heterozygous males 
and females. 

The transgenic mice comprising disruptions in 5-HT2-B genes were analyzed for phenotypic 
25 changes and expression patterns. The phenotypes associated with a disruption in nuclear receptor 
genes were determined. The homozygous mice demonstrated at least one of the following 
phenotypes: 

Embryonic Lethality. Homozygous mutant embryos die between E8.5 and E9.5 

Embryos were isolated at 8.5 to 12.5 days post coitum. Homozygous offspring were detected 
30 by PCR at E8.5, but not at later stages. At E8.5 homozygous were abnormal and retarded in 
development Further, resorptions were found at E8 .5 and all later stages examined. Where 
genotyping was feasible on partial resorptions, some scored as heterozygotes possibly due to maternal 
cell infiltration. These data suggest that death is occurring at multiple stages of development 

Embryos were isolated at E8.5 to E12.5. Six litters were examined comprising of 47 embryos, 
35 resorptions and partial resorptions, of which 36 were successfully genotyped These results are 
summarized below in Table 1. 
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TABLE 1. 5-HT2-B: Embryonic Development 



Litter 


Embryonic stage 


+/+ 


+/- 


-/- 


complete 
resorption/unknown 


1 


E8.5 (0-8 somites) 


0 


3 


0 


4 


2 


E8.5 (0-9 somites) 


4 


2 


2 


2 


3 


E8.5 (0-9 somites) 


6 


2 


0 


0 


4 


E9.5 


2 


6 


0 


2 


5 


E12.5 


3 


1 


0 


4 


6 


E12.5 


3 


2 


0 


2 



Expression: 

Tissues of the transgenic animals (heterozygotes) were analyzed for expression of the target gene. 
5 Organs from one heterozygous male and one heterozygous female were frozen, sectioned (10 pirn), 
stained and analyzed for lacZ expression using X-Gal as a substrate for beta-galactosidase. Nuclear 
Fast Red was used for counterstaining. 

Organs and tissues collected and frozen: brain, sciatic nerve, eye, Harderian glands, thymus, 
spleen, lymph nodes, bone marrow, aorta, heart, lung, liver, gallbladder, pancreas, kidney, urinary 
10 bladder, trachea, larynx, esophagus, thyroid gland, pituitary gland, adrenal glands, salivary glands, 
stomach, small and large intestines, tongue, skeletal muscle, skin and reproductive system. 

In addition, the brain of the heterozygous female was analyzed for lacZ expression as 
wholemount. The dissected brain was cut longitudinally, fixed and stained using X-Gal as a substrate 
for beta-galactosidase. To stop the reaction the brain was washed in PBS and fixed in PBS-buffered 
15 formaldehyde. 

Wild type control tissues were stained for X-gal to reveal background or signals due to 
endogenous beta-galactosidase activity. The following tissues show staining in the wild-type control 
sections and are therefore not suitable for X-gal staining: small and large intestines, stomach, vas 
deferens and epididymis. It has been previously reported that these organs contain high levels of 

20 endogenous beta-galactosidase activity. 
The results were as follows: 

LacZ (beta-galactosidase) expression was detectable in testis and eye. 
Male Reproductive System (Testis): A weak, punctuated staining pattern was observable in the 
epithelium of seminiferous tubules. 

25 Eye: A medium, punctuated staining pattern was detectable in the retina. 

Example 3: Generation and Analysis of Mice Comprising Chemokine Receptor 9 A Gene 
Disruptions 

To investigate the role of chemokine receptor 9 As, disruptions in chemokine receptor 9A 
genes were produced by homologous recombination. Specifically, transgenic mice comprising 
30 disruptions in chemokine receptor 9A genes were created. More particularly, as shown in Figure 6, a 
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chemokine receptor 9A~specific targeting construct having the ability to disrupt or modify chemokine 
receptor 9A genes, specifically comprising SEQ ID NO:9 was created using as the targeting arms 
(homologous sequences) in the construct, the oligonucleotide sequences identified herein as SEQ ID 
NO:llorSEQIDNO:12. 

5 The targeting construct was introduced into ES cells derived from the 129/Sv-+P+Mgf-SLJ/J 

mouse substrain to generate chimeric mice. The Fl mice were generated by breeding with C57BL/6 
females, and the F2 homozygous mutant mice were produced by intercrossing Fl heterozygous males 
and females. 

The transgenic mice comprising disruptions in chemokine receptor 9 A genes were analyzed 
10 for phenotypic changes and expression patterns. The phenotypes associated with a disruption in 
nuclear receptor genes were determined. The homozygous mice demonstrated at least one of the 
following phenotypes: 
Expression. 

Total RNA was isolated from the organs or tissues from adult C57B1/6 wild type mice. RNA 
15 was DNasel treated, and reverse transcribed using random primers. The resulting cDNA was checked 
for the absence of genomic contamination using primers specific to non-transcribed genomic mouse 
DNA. cDNAs were balanced for concentration using HPRT primers. RNA transcripts were detectable 
in lung, spleen, thymus, lymph nodes, bone marrow, salivary gland, skeletal muscle and small 
intestine. 

20 Tissues of the transgenic animals were analyzed for expression of the target gene. Organs 

from one heterozygous male and one heterozygous female were frozen, sectioned (10 fim), stained 
and analyzed for lacZ expression using X-Gal as a substrate for beta-galactosidase. Nuclear Fast Red 
was used for counterstaining. 

Organs and tissues collected and frozen: brain, sciatic nerve, eye, Harderian glands, thymus, 

25 spleen, lymph nodes, bone marrow, aorta, heart, lung, liver, gallbladder, pancreas, kidney, urinary 
bladder, trachea, larynx, esophagus, thyroid gland, pituitary gland, adrenal glands, salivary glands, 
stomach, small and large intestines, tongue, skeletal muscle, skin and reproductive system. 

In addition, the brain of the heterozygous female was analyzed for lacZ expression as 
wholemount. The dissected brain was cut longitudinally, fixed and stained using X-Gal as a substrate 

30 for beta-galactosidase. To stop the reaction the brain was washed in PBS and fixed in PBS-buffered 
formaldehyde. 

Wild type control tissues were stained for X-gal to reveal background or signals due to 
endogenous beta-galactosidase activity. The following tissues show staining in the wild-type control 
sections and are therefore not suitable for X-gal staining: small and large intestines, stomach, vas 
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deferens and epididymis. It has been previously reported that these organs contain high levels of 
endogenous beta-galactosidase activity. 

The results were as follows: LacZ (beta-galactosidase) expression was detectable in testis, 
tongue and esophagus. 

5 Male reproductive system: Many nuclei in seminiferous tubules of the testis stained weakly. 
Esophagus: Strong X-Gal signals were present in several cells in the mucous layer. 
Tongue: Epithelial cells of the mucous glands stained moderately. 
Behavior. 

For behavioral studies, homozygous mice were produced as follows: 
10 The targeting construct described above was introduced into ES cells derived from the 

129/SvEv mouse substrain to generate chimeric mice. F1N0 mice were generated by breeding with 
C57BL/6 females. F2N0 homozygous mutant mice were produced by intercrossing Fl heterozygous 
males and females. F1N0 heterozygotes were backcrossed to C57BL/6 mice to generate F1N1 
heterozygotes. F2N1 homozygous mice were produced by intercrossing F1N1 heterozygous males 
15 and females. 

The homozygous mice demonstrated the following behavioral phenotypes: 
When compared to age- and gender-matched wild-type control mice, homozygous mutant 
mice exhibited decreased agility, coordination, or balance as characterized by decreased performance 
on an accelerating rotarod. Mutants fell from the accelerating rotarod at lower speeds across all three 
20 trials. By the third trial there was a statistically significant difference between the mutants and wild 
type mice, as shown below in Table 2. 



TABLE 2, Chemokine Receptor 9A - Rotarod Test Fall Speed 



Gene 


Family 


SubFamity 


Gene Name 


Genotype 


F 

Generation 


N 

Generation 


Trial 


Value 


Std. 
Err. 


Count 


365 


GPCR 


Chemokine 


chemokine 
receptor 9A 


+/+ 


2 


0 


Trial 
1 


5.96 


0.79 


12 


365 


GPCR 


Chemokine 


chemokine 
receptor 9A 


+/+ 


2 


0 


Trial 
2 


7.48 


1.07 


12 


365 


GPCR 


Chemokine 


chemokine 
receptor 9A 


+/+ 


2 


0 


Trial 
3 


7.57 


0.93 


12 


365 


GPCR 


Chemokine 


chemokine 
receptor 9A 


+/+ 


2 


1 


Trial 
1 


6.62 


0.42 


13 


365 


GPCR 


Chemokine 


chemokine 
receptor 9A 


+/+ 


2 


1 


Trial 
2 


8.50 


0.97 


13 


365 


GPCR 


Chemokine 


chemokine 
receptor 9A 


+/+ 


2 


1 


Trial 
3 


11.42 


0.66 


13 


365 


GPCR 


Chemokine 


chemokine 
receptor 9A 


-/- 


2 


0 


Trial 
1 


6.17 


0.33 


10 


365 


GPCR 


Chemokine 


chemokine 
receptor 9A 


-/- 


2 


0 


Trial 
2 


6.38 


0.54 


10 


365 


GPCR 


Chemokine 


chemokine 
receptor 9A 




2 


0 


Trial 
3 


6.49 


0.70 


10 
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Gene 


Family 


SubFamlty 


Gene Name 


Genotype 


F 

Generation 


N 

Generation 


Trial 


Value 


Std. 
err. 


Count 


365 


GPCR i 


Chemokine 


chemokine 
receptor 9A 


-/- 


2 


1 


Trial 
1 


5.70 


0.68 


10 


365 


GPCR : 


Chemokine 


chemokine 
receptor 9A 


-/- 


2 


1 


Trial 
2 


7.13 


0.82 


10 


365 


GPCR 


Chemokine 


chemokine 
receptor 9A 


-/- 


2 


1 


Trial 
3 


8.31 


1.11 


10 



Example 4: Generation and Analysis of Mice Comprising Glucocorticoid-Induced Receptor 
Gene Disruptions 

To investigate the role of glucocorticoid-induced receptors, disruptions in glucocorticoid- 
5 induced receptor genes were produced by homologous recombination. Specifically, transgenic mice 
comprising disruptions in glucocorticoid-induced receptor genes were created. More particularly, as 
shown in Figure 8, a glucocorticoid-induced receptor-specific targeting construct having the ability to 
disrupt or modify glucocorticoid-induced receptor genes, specifically comprising SEQ ID NO: 13 was 
created using as the targeting arms (homologous sequences) in the construct, the oligonucleotide 
10 sequences identified herein as SEQ ID NO:15 or SEQ ID NO:16. 

The targeting construct was introduced into ES cells derived from the 129/Sv-+P+Mgf-SLJ/J 
mouse substrain to generate chimeric mice. The Fl mice were generated by breeding with C57BL/6 
females, and the F2 homozygous mutant mice were produced by intercrossing Fl heterozygous males 
and females. 

15 • The transgenic mice comprising disruptions in glucocorticoid-induced receptor genes were 

analyzed for phenotypic changes and expression patterns. The phenotypes associated with a 
disruption in nuclear receptor genes were determined. The homozygous mice demonstrated at least 
one of the following phenotypes: 
Expression. 

20 Tissues of the transgenic animals were analyzed for expression of the target gene. Organs 

from one heterozygous male and one heterozygous female were frozen, sectioned (10 /un), stained 
and analyzed for lacZ expression using X-Gal as a substrate for beta-galactosidase. Nuclear Fast Red 
was used for counterstaining. 

Organs and tissues collected and frozen: brain, sciatic nerve, eye, Harderian glands, thymus, 

25 spleen, lymph nodes, bone marrow, aorta, heart, lung, liver, gallbladder, pancreas, kidney, urinary 
bladder, trachea, larynx, esophagus, thyroid gland, pituitary gland, adrenal glands, salivary glands, 
stomach, small and large intestines, tongue, skeletal muscle, skin and reproductive system 

In addition, the brain of the heterozygous female was analyzed for lacZ expression as 
wholemount The dissected brain was cut longitudinally, fixed and stained using X-Gal as a substrate 
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for beta-galactosidase. To stop the reaction the brain was washed in PBS and fixed in PBS-buffered 
formaldehyde. 

Wild type control tissues were stained for X-gal to reveal background or signals due to 

endogenous beta-galactosidase activity. The following tissues show staining in the wild-type control 
5 sections and are therefore not suitable for X-gal staining: small and large intestines, stomach, vas 

deferens and epididymis. It has been previously reported that these organs contain high levels of 

endogenous beta-galactosidase activity. 

The results were as follows: 

LacZ expression was detectable in brain, pharynx, testis and prostate. 
10 Brain: In wholemount stained brains strong signals were observed in the cerebrum in thalamus, and 

surrounding areas including the caudate putamen. Faint lacZ expression was apparent along the 

cerebellar lobes of the cerebellum. On frozen sections of the heterozygous male few cells stained 

positive in the caudate putamen, but no lacZ signals were detectable in the cerebellum. 

Testis: Weak staining was observed in few cells in the seminiferous tubules. 
15 Prostate: Few epithelial cells stained weakly. 

Pharynx: Few cells in the connective tissue showed weak lacZ signals. 

Behavior. 

For behavioral studies, homozygous mice were produced as follows: 
The targeting construct described above was introduced into ES cells derived from the 
20 129/SvEv mouse substrain to generate chimeric mice. F1N0 mice were generated by breeding with 
C57BL/6 females. F2N0 homozygous mutant mice were produced by intercrossing Fl heterozygous 
males and females. F1N0 heterozygotes were backcrossed to C57BL/6 mice to generate F1N1 
heterozygotes. F2N1 homozygous mice were produced by intercrossing F1N1 heterozygous males 
and females. 

25 The homozygous mice demonstrated the following behavioral phenotypes: 

When compared to age- and gender-matched wild-type control mice, homozygous mutant 
animals from the Nl generation showed a statistically significant increase in total distance traveled 
and the percent of time spent in the central region of the test chamber in the open field test, relative to 
Nl generation wild-type animals (Table 3). This suggests that the mutant mice may be more 

30 hyperactive and less anxious than their wild-type littermates. However, NO generation animals did not 
display any difference in these measurements. 

Table 3. Glucocorticoid-Induced Receptor: 
Open Field- Session Time in Central Region 



Genotype 


F 

Generation 


N 

Generation 


Gene 


Family 


SubFamffy 


Gene Name 


Value 


Std. Err. 


Count 






0 


359 


GPCR 


Orphan GPCR 


oliimrortirniH irvtiirpri 


38.30 


6.56 


10 
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Genotype 


F 

Generation 


N 

Generation 


Gene 


Family 


SubFamily 


Gene Name 


Value 


Std. Err. 


Count 














receptor 








+/+ 


2 


1 


359 


GPCR 


Orphan GPCR 


glucocorticoid induced 
receptor 


29.79 


3.94 


11 


-/- 


2 


0 


359 


GPCR 


Orphan GPCR 


glucocorticoid induced 
receptor 


39.50 


8.47 


10 




2 


1 


359 


GPCR 


Orphan GPCR 


glucocorticoid induced 
receptor 


45.40 


4.19 


10 



Homozygous mutant animals from the Nl generation also spent significantly less time immo- 
bile in the tail suspension test relative to wild-types (Table 4). This may indicate that mutant mice 
have less of a propensity towards behavioral despair or depression than their wild-type littermates 
5 (anti-depressive behavior phenotype). However, NO generation animals did not display any difference 
in this test. 

Table 4. Glucocorticoid-Induced Receptor: 
Tail Suspension - Total Time Immobile 



Genotype 


F 

Generation 


N 

Generation 


Gene 


Family 


SubFamily 


Gene Name 


Value 


Std. 
Err. 


Count 


+/+ 


2 


0 


359 


GPCR 


Orphan GPCR * 


glucocorticoid 
induced receptor 


142.18 


23.94 


10 


+/+ 


2 


1 


359 


GPCR 


Orphan GPCR 


glucocorticoid 
induced receptor 


173.05 


15.51 


11 


-/- 


2 


0 


359 


GPCR 


Orphan GPCR 


glucocorticoid 
induced receptor 


114.21 


28.99 


10 


-/■ 


2 


1 


359 


GPCR 


Orphan GPCR 


glucocorticoid 
induced receptor 


135.22 


8.36 


10 



The discrepancy in the results observed in the Open Field and Tail Suspension Tests between 
generations may reflect differences in the background strains used to generate the mice. 

As is apparent to one of skill in the art, various modifications of the above embodiments can 
be made without departing from the spirit and scope of this invention. These modifications and 
15 variations are within the scope of this invention. 
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We claim: 

1. A targeting construct comprising: 

(a) a first polynucleotide sequence homologous to a target gene selected from the group 
consisting of: a melanocortin-3 gene, a 5-HT-2B gene, a chemokine receptor 9A gene, a 
glucocorticoid-induced receptor gene, an orphan GPR10 (UHR-1) gene, an orphan GPR14 
gene, an orphan GPR15 gene, a beta chemokine receptor (E01) gene, an endothelial 
differentiation GPCR 3 (EDG3) gene, an ATP receptor P2Ulgene, and an adenosine 3 
receptor gene; 

(b) a second polynucleotide sequence homologous to the target gene; and 

(c) a selectable marker. 

2. The targeting construct of claim 1, wherein the targeting construct further comprises a screening 
marker. 

3. A method of producing a targeting construct for a target gene selected from the group consisting 
of a melanocortin-3 gene, a 5-HT-2B gene, a chemokine receptor 9A gene, a glucocorticoid- 
induced receptor gene, an orphan GPR10 (UHR-1) gene, an orphan GPR14 gene, an orphan 
GPR15 gene, a beta chemokine receptor (E01) gene, an endothelial differentiation GPCR 3 
(EDG3) gene, an ATP receptor P2Ulgene, and an adenosine 3 receptor gene, the method 
comprising: 

(a) obtaining a first polynucleotide sequence homologous to the target gene; 

(b) obtaining a second polynucleotide sequence homologous to the target gene; 

(c) providing a vector comprising a selectable marker; and 

(d) inserting the first and second sequences into the vector, to produce the targeting construct 

4. A method of producing a targeting construct for a target gene selected from the group consisting 
of: a melanocortin-3 gene, a 5-HT-2B gene, a chemokine receptor 9A gene, a glucocorticoid- 
induced receptor gene, an orphan GPR10 (UHR-1) gene, an orphan GPR14 gene, an orphan 
GPR15 gene, a beta chemokine receptor (E01) gene, an endothelial differentiation GPCR 3 
(EDG3) gene, an ATP receptor P2Ulgene, and an adenosine 3 receptor gene, the method 
comprising: 

(a) providing a polynucleotide comprising a first sequence homologous to a first region of the 
target gene and a second sequence homologous to a second region of the target gene; and 

(b) inserting a positive selection marker between the first and second sequences to form the 
targeting construct 

5. A cell comprising a disruption in a target gene selected from the group consisting of: a 
melanocortin-3 gene, a 5-HT-2B gene, a chemokine receptor 9A gene, a glucocorticoid-induced 
receptor gene, an orphan GPR10 (UHR-1) gene, an orphan GPR14 gene, an orphan GPR15 gene, 
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a beta chemokine receptor (E01) gene, an endothelial differentiation GPCR 3 (EDG3) gene, an 
ATP receptor P2Ulgene, and an adenosine 3 receptor gene. 

6. The cell of claim 5, wherein the cell is a murine cell. 

7. The cell of claim 6, wherein the murine cell is an embryonic stem cell. 

8. A non-human transgenic animal comprising a disruption in a target gene selected from the group 
consisting of: a melanocortin-3 gene, a 5-HT-2B gene, a chemokine receptor 9A gene, a 
glucocorticoid-induced receptor gene, an orphan GPR10 (UHR-1) gene, an orphan GPR14 gene, 
an orphan GPR15 gene, a beta chemokine receptor (E01) gene, an endothelial differentiation 
GPCR 3 (EDG3) gene, an ATP receptor P2Ulgene, and an adenosine 3 receptor gene. 

9. A cell derived from the non-human transgenic animal of claim 8. 

10. A method of producing a transgenic mouse, the method comprising: 

(a) introducing the targeting construct of claim 1 into a cell; 

(b) introducing the cell into a blastocyst; 

(c) implanting the resulting blastocyst into a pseudopregnant mouse, wherein said 
pseudopregnant mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse. 

1 1 . A method of identifying an agent that modulates the expression of a target gene selected from the 
group consisting of: a melanocortin-3 gene, a 5-HT-2B gene, a chemokine receptor 9A gene, a 
glucocorticoid-induced receptor gene, an orphan GPR10 (UHR-1) gene, an orphan GPR14 gene, 
an orphan GPR15 gene, a beta chemokine receptor (E01) gene, an endothelial differentiation 
GPCR 3 (EDG3) gene, an ATP receptor P2Ulgene, and an adenosine 3 receptor gene, the method 
comprising: 

(a) providing a non-human transgenic animal comprising a disruption in the target gene; 

(b) administering an agent to the non-human transgenic animal; and 

(c) determining whether the expression of the disrupted target gene in the non-human 
transgenic animal is modulated. 

12. A method of identifying an agent that modulates the function of a target gene selected from the 
group consisting of a melanocortin-3 gene, a 5-HT-2B gene, a chemokine receptor 9A gene, a 
glucocorticoid-induced receptor gene, an orphan GPR10 (UHR-1) gene, an orphan GPR14 gene, 
an orphan GPR15 gene, a beta chemokine receptor (E01) gene, an endothelial differentiation 
GPCR 3 (EDG3) gene, an ATP receptor P2Ulgene, and an adenosine 3 receptor gene, the method 
comprising: 

(a) providing a non-human transgenic animal comprising a disruption in the target gene; 

(b) administering an agent to the non-human transgenic animal; and 
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(c) determining whether the function of the disrupted target gene in the non-human 
transgenic animal is modulated. 

13. A method of identifying an agent that modulates the expression of a target gene selected from the 
group consisting of: a melanocortin-3 gene, a 5-HT-2B gene, a chemokine receptor 9A gene, a 
glucocorticoid-induced receptor gene, an orphan GPR10 (UHR-1) gene, an orphan GPR14 gene, 
an orphan GPR15 gene, a beta chemokine receptor (E01) gene, an endothelial differentiation 
GPCR 3 (EDG3) gene, an ATP receptor P2Ulgene, and an adenosine 3 receptor gene, the method 
comprising: 

(a) providing a cell comprising a disruption in the target gene; 

(b) contacting the cell with an agent; and 

(c) determining whether expression of the target gene is modulated. 

14. A method of identifying an agent that modulates the function of a target gene selected from the 
group consisting of: a melanocortin-3 gene, a 5-HT-2B gene, a chemokine receptor 9A gene, a 
glucocorticoid-induced receptor gene, an orphan GPR10 (UHR-1) gene, an orphan GPR14 gene, 
an orphan GPR15 gene, a beta chemokine receptor (E01) gene, an endothelial differentiation 
GPCR 3 (EDG3) gene, an ATP receptor P2Ulgene, and an adenosine 3 receptor gene, the method 
comprising: 

(a) providing a cell comprising a disruption in the target gene; 

(b) contacting the cell with an agent; and 

(c) determining whether the function of the target gene is modulated. 

15. The method of claim 13 or claim 14, wherein the cell is derived from the non-human transgenic 
animal of claim 8. 

16. An agent identified by the method of claim 1 1, claim 12, claim 13, or claim 14. 

17. A targeting construct comprising: 

(a) a first polynucleotide sequence homologous to a melanocortin-3 receptor gene; 

(b) a second polynucleotide sequence homologous to the melanocortin-3 receptor gene; and 

(c) a selectable marker. 

18. The targeting construct of claim 17, wherein the targeting construct further comprises a screening 
marker. 

19. A method of producing a targeting construct, the method comprising: 

(a) providing a first polynucleotide sequence homologous to a melanocortin-3 receptor gene; 

(b) providing a second polynucleotide sequence homologous to the melanocortin-3 receptor; 

(c) providing a selectable marker; and 



WO 02/03793 



PCT/US01/21923 



-54- 

(d) inserting the first sequence, second sequence, and selectable marker into a vector, to 
produce the targeting construct. 

20. A method of producing a targeting construct, the method comprising: 

(a) providing a polynucleotide comprising a first sequence homologous to a first region of a 
melanocortin-3 receptor gene and a second sequence homologous to a second region of a 
melanocortin-3 receptor gene; 

(b) inserting a positive selection marker in between the first and second sequences to form 
the targeting construct. 

21. A cell comprising a disruption in a melanocortin-3 receptor gene. 

22. The cell of claim 21, wherein the cell is a murine cell. 

23. The cell of claim 22, wherein the murine cell is an embryonic stem ceH. 

24. A non-human transgenic animal comprising a disruption in a melanocortin-3 receptor gene. 

25. A cell derived from the non-human transgenic animal of claim 24. 

26. A method of producing a transgenic mouse comprising a disruption in a melanocortin-3 receptor 
gene, the method comprising: 

(a) introducing the targeting construct of claim 17 into a cell; 

(b) introducing the cell into a blastocyst; 

(c) implanting the resulting blastocyst into a pseudopregnant mouse, wherein said 
pseudopregnant mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse. 

27. A method of identifying an agent that modulates the expression of a melanocortin-3 receptor, the 
method comprising: 

(a) providing a non-human transgenic animal comprising a disruption in a melanocortin-3 
receptor gene; 

(b) administering an agent to the non-human transgenic animal; and 

(c) determining whether the expression of melanocortin-3 receptor in the non-human 
transgenic animal is modulated. 

28. A method of identifying an agent that modulates the function of a melanocortin-3 receptor, the 
method comprising: 

(a) providing a non-human transgenic animal comprising a disruption in a melanocortin-3 
receptor gene; 

(b) administering an agent to the non-human transgenic animal; and 

(c) determining whether the function of the disrupted melanocortin-3 receptor gene in the 
non-human transgenic animal is modulated. 
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29. A method of identifying an agent that modulates the expression of melanocortin-3 receptor, the 
method comprising: 

(a) providing a cell comprising a disruption in a melanocortin-3 receptor gene; 

(b) contacting the cell with an agent; and 

(c) determining whether expression of the melanocortin-3 receptor is modulated. 

30. A method of identifying an agent that modulates the function of a melanocortin-3 receptor gene, 
the method comprising: 

(a) providing a cell comprising a disruption in a melanocortin-3 receptor gene; 

(b) contacting the cell with an agent; and 

(c) determining whether the function of the melanocortin-3 receptor gene is modulated. 

3 1 . The method of claim 29 or claim 30, wherein the cell is derived from the non-human transgenic 
animal of claim 24. 

32. An agent identified by the method of claim 27, claim 28, claim 29, or claim 30. 

33. A transgenic mouse comprising a disruption in a melanocortin-3 receptor gene, wherein the 
transgenic mouse exhibits at least one of the following phenotypes: a kidney abnormality or a 
behavioral abnormality. 

34. The transgenic mouse of claim 33, wherein the kidney abnormality is absence of one kidney. 

35. The transgenic mouse of claim 33, wherein the kidney abnormality is reduced size of the kidney 
relative to a wild-type mouse. 

36. The transgenic mouse of claim 33, wherein the kidney comprises unilateral renal agenesis. 

37. The transgenic mouse of claim 33, wherein the behavioral abnormality is passivity. 

38. The transgenic mouse of claim 33, wherein the behavioral abnormality is hypoactivity. 

39. The transgenic mouse of claim 33, wherein the behavioral abnormality is decreased locomotion. 

40. The transgenic mouse of claim 33, wherein the behavioral abnormality is a decrease in the attempt 
to escape while being examined relative to a wild type mouse. 

41 . The transgenic mouse of claim 33, wherein the behavioral abnormality is absence of any attempt 
to escape while being examined. 

42. The transgenic mouse of claim 33, wherein the behavioral abnormality is observed in males. 

43. A method of producing a transgenic mouse comprising a disruption in a melanocortin-3 receptor 
gene, wherein the transgenic mouse exhibits at least one of the following phenotypes: a kidney 
abnormality or a behavioral abnormality, the method comprising: 

(a) introducing a melanocortin-3 receptor gene targeting construct into a cell; 

(b) introducing the cell into a blastocyst; 
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(c) implanting the resulting blastocyst into a pseudopregnant mouse, wherein said 
pseudopregnant mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse comprising a disruption in 
a melanocortin-3 receptor gene. 

44. A transgenic mouse produced by the method of claim 43. 

45. A cell derived from, the transgenic mouse of claim 33 or claim 44. 

46. A method of identifying an agent that ameliorates a phenotype associated with a disruption in a 
melanocortin-3 receptor gene, the method comprising: 

(a) administering an agent to a transgenic mouse comprising a disruption in a melanocortin-3 
receptor gene; and 

(b) determining whether the agent ameliorates at least one of the following phenotypes: a 
kidney abnormality or a behavioral abnormality. 

47. A method of identifying an agent that modulates melanocortin-3 receptor expression, the method 
comprising: 

(a) administering an agent to the transgenic mouse comprising a disruption in a 
melanocortin-3 receptor gene; and 

(b) determining whether the agent modulates melanocortin-3 receptor expression in the 
transgenic mouse, wherein the agent has an effect on at least one of the following behaviors: 
passivity, locomotion or attempts to escape while being examined. 

48. A method of identifying an agent that modulates a behavior associated with a disruption in a 
melanocortin-3 receptor gene, the method comprising: 

(a) administering an agent to a transgenic mouse comprising a disruption in a melanocortin-3 
receptor gene; and 

(b) determining whether the agent modulates passivity, locomotion or attempts to escape 
while being examined. 

49. A method of identifying an agent that modulates melanocortin-3 receptor gene function, the 
method comprising: 

(a) providing a cell comprising a disruption in a melanocortin-3 receptor gene; 

(b) contacting the cell with an agent; and 

(c) determining whether the agent modulates melanocortin-3 receptor gene function, wherein 
the agent modulates a phenotype associated with a disruption in a melanocortin-3 receptor 
gene. 

50. The method of claim 49, wherein the phenotype comprises at least one of the following: a kidney 
abnormality or a behavioral abnormality. 
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51. An agent identified by the method of claim 46, claim 47, claim 48, or claim 49. 

52. An agonist or antagonist of a melanocortin-3 receptor. 

53. A targeting construct comprising: 

(a) a first polynucleotide sequence homologous to a 5-HT-2B gene; 

(b) a second polynucleotide sequence homologous to the 5-HT-2B gene; and 
(a) a selectable marker. 

54. The targeting construct of claim 53, wherein the targeting construct further comprises a screening 
marker. 

55. A method of producing a targeting construct, the method comprising: 

(a) providing a first polynucleotide sequence homologous to a 5-HT-2B gene; 

(b) providing a second polynucleotide sequence homologous to the 5-HT-2B ; 

(c) providing a selectable marker; and 

(d) inserting the first sequence, second sequence, and selectable marker into a vector, to 
produce the targeting construct. 

56. A method of producing a targeting construct, the method comprising: 

(a) providing a polynucleotide comprising a first sequence homologous to a first region of a 
5-HT-2B gene and a second sequence homologous to a second region of a 5-HT-2B gene; 

(b) inserting a positive selection marker in between the first and second sequences to form 
the targeting construct. 

57. A cell comprising a disruption in a 5-HT-2B gene. 

58. The cell of claim 57, wherein the cell is a murine cell. 

59. The cell of claim 58, wherein the murine cell is an embryonic stem cell. 

60. A non-human transgenic animal comprising a disruption in a 5-HT-2B gene. 

61. A cell derived from the non-human transgenic animal of claim 60. 

62. A method of producing a transgenic mouse comprising a disruption in a 5-HT-2B gene, the 
method comprising: 

(a) introducing the targeting construct of claim 53 into a cell; 

(b) introducing the cell into a blastocyst; 

(c) implanting the resulting blastocyst into a pseudopregnant mouse, wherein said 
pseudopregnant mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse. 

63. A method of identifying an agent that modulates the expression of a 5-HT-2B, the method 
comprising: 

(a) providing a non-human transgenic animal comprising a disruption in a 5-HT-2B gene; 
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(b) administering an agent to the non-human transgenic animal; and 

(c) determining whether the expression of 5-HT-2B in the non-human transgenic animal is 
modulated. 

64. A method of identifying an agent that modulates the function of a 5-HT-2B, the method 
comprising: 

(a) providing a non-human transgenic animal comprising a disruption in a 5-HT-2B gene; 

(b) administering an agent to the non-human transgenic animal; and 

(c) determining whether the function of the disrupted 5-HT-2B gene in the non-human 
transgenic animal is modulated. 

65. A method of identifying an agent that modulates the expression of 5-HT-2B, the method 
comprising: 

i) providing a cell comprising a disruption in a 5-HT-2B gene; 

(b) contacting the cell with an agent; and 

(c) determining whether expression of the 5-HT-2B is modulated. 

66. A method of identifying an agent that modulates the function of a 5-HT-2B gene, the method 
comprising: 

(a) providing a cell comprising a disruption in a 5-HT-2B gene; 

(b) contacting the cell with an agent; and 

(c) determining whether the function of the 5-HT-2B gene is modulated. 

67. The method of claim 65 or claim 66, wherein the cell is derived from the non-human transgenic 
animal of claim 60. 

68. An agent identified by the method of claim 63, claim 64, claim 65, or claim 66. 

69. A transgenic mouse comprising a disruption in a 5-HT-2B gene, wherein the transgenic mouse 
exhibits at least one of the following phenotypes: embryonic lethality, abnonnal embryos, 
retarded development, and reabsorbed embryos. 

70. The transgenic mouse of claim 69, wherein development is arrested at embryonic day 8.5. 

71. The transgenic mouse of claim 69, wherein homozygous offspring are undetectable after 
embryonic day E8.5. 

72. The transgenic mouse of claim 69, wherein homozygous embryos die between embryonic day 8.5 
and embryonic day 9.5. 

73. The transgenic mouse of claim 69, wherein the wherein the embryos are reabsorbed between 
embryonic day 8.5 and embryonic day 9.5. 
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74. A method of producing a transgenic mouse comprising a disruption in a 5-HT-2B gene, wherein 
the transgenic mouse exhibits at least one of the following phenotypes: embryonic lethality, 
abnormal embryos, retarded development, and reabsorbed embryos, the method comprising: 

(a) introducing a 5-HT-2B gene targeting construct into a cell; 

(b) introducing the cell into a blastocyst; 

(c) implanting the resulting blastocyst into a pseudopregnant mouse, wherein said 
pseudopregnant mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse comprising a disruption in 
a 5-HT-2B gene. 

75. A transgenic mouse produced by the method of claim 74. 

76. A cell derived from the transgenic mouse of claim 60 or claim 75. 

77. A method of identifying an agent that ameliorates a phenotype associated with a disruption in a 5- 
HT-2B gene, the method comprising: 

(a) administering an agent to a transgenic mouse comprising a disruption in a 5-HT-2B gene; 
and 

(b) determining whether the agent ameliorates at least one of the following phenotypes: 
embryonic lethality, abnormal embryos, retarded development, and reabsorbed embryos. 

78. An agonist or antagonist of a 5-HT-2B receptor. 

79. A targeting construct comprising: 

(a) a first polynucleotide sequence homologous to a chemokine receptor 9A gene; 

(b) a second polynucleotide sequence homologous to the chemokine receptor 9A gene; and 

(c) a selectable marker. 

80. The targeting construct of claim 79, wherein the targeting construct further comprises a screening 
marker. 

81. A method of producing a targeting construct, the method comprising: 

(a) providing a first polynucleotide sequence homologous to a chemokine receptor 9A gene; 

(b) providing a second polynucleotide sequence homologous to the chemokine receptor 9A; 

(c) providing a selectable marker, and 

(d) inserting the first sequence, second sequence, and selectable marker into a vector, to 
produce the targeting construct. 

82. A method of producing a targeting construct, the method comprising: 

(a) providing a polynucleotide comprising a first sequence homologous to a first region of a 
chemokine receptor 9 A gene and a second sequence homologous to a second region of a 
chemokine receptor 9A gene; 
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(b) inserting a positive selection marker in between the first and second sequences to form 
the targeting construct. 

83. A cell comprising a disruption in a chemokine receptor 9 A gene. 

84. The cell of claim 83, wherein the cell is a murine cell. 

85. The cell of claim 84, wherein the murine cell is an embryonic stem cell. 

86. A non-human transgenic animal comprising a disruption in a chemokine receptor 9A gene. 

87. A cell derived from the non-human transgenic animal of claim 86. 

88. A method of producing a transgenic mouse comprising a disruption in a chemokine receptor 9A 
gene, the method comprising: 

(a) introducing the targeting construct of claim 79 into a cell; 

(b) introducing the cell into a blastocyst; 

(c) implanting the resulting blastocyst into a pseudopregnant mouse, wherein said 
pseudopregnant mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse. 

89. A method of identifying an agent that modulates the expression of a chemokine receptor 9A, the 
method comprising: 

(a) providing a non-human transgenic animal comprising a disruption in a chemokine 
receptor 9A gene; 

(b) administering an agent to the non-human transgenic animal; and 

(c) detemrining whether the expression of chemokine receptor 9A in the non-human 
transgenic animal is modulated. 

90. A method of identifying an agent that modulates the function of a chemokine receptor 9 A, the 
method comprising: 

(a) providing a non-human transgenic animal comprising a disruption in a chemokine 
receptor 9A gene; 

(b) administering an agent to the non-human transgenic animal; and 

(c) determining whether the function of the disrupted chemokine receptor 9A gene in the 
non-human transgenic animal is modulated. 

91. A method of identifying an agent that modulates the expression of chemokine receptor 9 A, the 
method comprising: 

(a) providing a cell comprising a disruption in a chemokine receptor 9A gene; 

(b) contacting the cell with an agent; and 

(c) determining whether expression of the chemokine receptor 9A is modulated. 
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92. A method of identifying an agent that modulates the function of a chemokine receptor 9A gene, 
the method comprising: 

(a) providing a cell comprising a disruption in a chemokine receptor 9A gene; 

(b) contacting the cell with an agent; and 

(c) determining whether the function of the chemokine receptor 9A gene is modulated. 

93. The method of claim 91 or claim 92, wherein the cell is derived from the non-human transgenic 
animal of claim 86. 

94. An agent identified by the method of claim 89, claim 90, claim 91, or claim 92. 

95. A transgenic mouse comprising a disruption in a chemokine receptor 9A gene, wherein the 
transgenic mouse exhibits at least one of the following phenotypes: decreased agility, 
coordination, or balance relative to a wild-type mouse. 

96. The transgenic mouse of claim 95, wherein decreased agility, coordination, or balance is 
characterized by decreased performance on an accelerating rotarod. 

97. The transgenic mouse of claim 95, wherein decreased agility, coordination, or balance is 
characterized by falling from an accelerating rotarod at lower speeds relative to a wild-type 
mouse. 

98. A method of producing a transgenic mouse comprising a disruption in a chemokine receptor 9A 
gene, wherein the transgenic mouse exhibits at least one of the following phenotypes: decreased 
agility, coordination, or balance relative to a wild-type mouse, the method comprising: 

(a) introducing a chemokine receptor 9A gene targeting construct into a cell; 

(b) introducing the cell into a blastocyst; 

(c) implanting the resulting blastocyst into a pseudopregnant mouse, wherein said 
pseudopregnant mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse comprising a disruption in 
a chemokine receptor 9A gene. 

99. A transgenic mouse produced by the method of claim 98. 

100. A cell derived from the transgenic mouse of claim 95 or claim 98. 

101. A method of identifying an agent that ameliorates a phenotype associated with a disruption in a 
chemokine receptor 9A gene, the method comprising: 

(a) administering an agent to a transgenic mouse comprising a disruption in a chemokine 
receptor 9A gene; and 

(b) detennining whether the agent ameliorates at least one of the following phenotypes: 
decreased agility, coordination, or balance relative to a wild-type mouse. 
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102. A method of identifying an agent that modulates chemokine receptor 9A expression, the method 
comprising: 

(a) administering an agent to the transgenic mouse comprising a disruption in a chemokine 
receptor 9 A gene; and 

(b) determining whether the agent modulates chemokine receptor 9A expression in the 
transgenic mouse, wherein the agent has an effect on at least one of the following behaviors: 
decreased agility, coordination, or balance relative to a wild-type mouse. 

103. A method of identifying an agent that modulates a behavior associated with a disruption in a 
chemokine receptor 9A gene, the method comprising: 

(a) administering an agent to a transgenic mouse comprising a disruption in a chemokine 
receptor 9A gene; and 

(b) determining whether the agent modulates agility, coordination, or balance of the 
transgenic mouse. 

104. A method of identifying an agent that modulates chemokine receptor 9A gene function, the 
method comprising: 

(a) providing a cell comprising a disruption in a chemokine receptor 9A gene; 

(b) contacting the cell with an agent; and 

(c) determining whether the agent modulates chemokine receptor 9A gene function, wherein 
the agent modulates a phenotype associated with a disruption in a chemokine receptor 9A 
gene. 

105. The method of claim 104, wherein the phenotype comprises at least one of the following: 
decreased agility, coordination, or balance relative to a wild-type mouse. 

106. An agent identified by the method of claim 101, claim 102, claim 103, or claim 104. 

107. An agonist or antagonist of a chemokine receptor 9A receptor. 

108. Phenotypic data associated with the transgenic mouse of claim 95 or claim 98, wherein the data 
is in a database. 

109. A targeting construct comprising: 

(a) a first polynucleotide sequence homologous to a glucocorticoid-induced receptor gene; 

(b) a second polynucleotide sequence homologous to the glucocorticoid-induced receptor 
gene; and 

(c) a selectable marker. 

110. The targeting construct of claim 111, wherein the targeting construct further comprises a 
screening marker. 

111. A method of producing a targeting construct, the method comprising: 
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(a) providing a first polynucleotide sequence homologous to a glucocorticoid-induced 
receptor gene; 

(b) providing a second polynucleotide sequence homologous to the glucocorticoid-induced 
receptor; 

(c) providing a selectable marker; and 

(d) inserting the first sequence, second sequence, and selectable marker into a vector, to 
produce the targeting construct. 

112. A method of producing a targeting construct, the method comprising: 

(a) providing a polynucleotide comprising a first sequence homologous to a first region of a 
glucocorticoid-induced receptor gene and a second sequence homologous to a second region 
of a glucocorticoid-induced receptor gene; 

(b) inserting a positive selection marker in between the first and second sequences to form 
the targeting construct. 

113. A cell comprising a disruption in a glucocorticoid-induced receptor gene. 

1 14. The cell of claim 113, wherein the cell is a murine cell. 

115. The cell of claim 1 14, wherein the murine cell is an embryonic stem cell. 

1 16. A non-human transgenic animal comprising a disruption in a glucocorticoid-induced receptor 
gene. 

1 17. A cell derived from the non-human transgenic animal of claim 116. 

118. A method of producing a transgenic mouse comprising a disruption in a glucocorticoid-induced 
receptor gene, the method comprising: 

(a) introducing the targeting construct of claim 111 into a cell; 

(b) introducing the cell into a blastocyst; 

(c) implanting the resulting blastocyst into a pseudopregnant mouse, wherein said 
pseudopregnant mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse. 

119. A method of identifying an agent that modulates the expression of a glucocorticoid-induced 
receptor, the method comprising: 

(a) providing a non-human transgenic animal comprising a disruption in a glucocorticoid- 
induced receptor gene; 

(b) administering an agent to the non-human transgenic animal; and 

(c) determining whether the expression of glucocorticoid-induced receptor in the non-human 
transgenic animal is modulated. 
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120. A method of identifying an agent that modulates the function of a glucocorticoid-induced 
receptor, the method comprising: 

(a) providing a non-human transgenic animal comprising a disruption in a glucocorticoid- 
induced receptor gene; 

(b) administering an agent to the non-human transgenic animal; and 

(c) determining whether the function of the disrupted glucocorticoid-induced receptor gene in 
the non-human transgenic animal is modulated. 

121 . A method of identifying an agent that modulates the expression of glucocorticoid-induced 
receptor, the method comprising: 

(a) providing a cell comprising a disruption in a glucocorticoid-induced receptor gene; 

(b) contacting the cell with an agent; and 

(c) determining whether expression of the glucocorticoid-induced receptor is modulated. 

122. A method of identifying an agent that modulates the function of a glucocorticoid-induced 
receptor gene, the method comprising: 

(a) providing a cell comprising a disruption in a glucocorticoid-induced receptor gene; 

(b) contacting the cell with an agent; and 

(c) determining whether the function of the glucocorticoid-induced receptor gene is 
modulated. 

123. The method of claim 121 or claim 122, wherein the cell is derived from the non-human 
transgenic animal of claim 1 16. 

124. An agent identified by the method of claim 119, claim 120, claim 121 or claim 122. 

125. A transgenic mouse comprising a disruption in a glucocorticoid-induced receptor gene, wherein 
the transgenic mouse exhibits at least one of the following phenotypes: hyperactivity, reduced 
anxiety, decreased propensity toward behavioral despair, or decreased propensity toward 
depression. 

126. The transgenic mouse of claim 125, wherein hyperactivity is characterized by an increase in 
total distance traveled in an open field test, relative to a wild-type mouse. 

127. The transgenic mouse of claim 125, wherein hyperactivity is characterized by an increase in the 
percent of time spent in the central region of the test chamber in an open field test, relative to a 
wild-type mouse. 

128. The transgenic mouse of claim 125, wherein reduced anxiety is characterized by an increase in 
total distance traveled in an open field test, relative to a wild-type mouse. 
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129. The transgenic mouse of claim 125, wherein reduced anxiety is characterized by an increase in 
the percent of time spent in the central region of the test chamber in an open field test, relative 
to a wild-type mouse. 

130. The transgenic mouse of claim 125, wherein the decreased propensity toward behavioral despair 
is characterized by less time immobile in a tail suspension test relative to a wild-type mouse. 

131. The transgenic mouse of claim 125, decreased propensity toward depression is characterized by 
less time immobile in a tail suspension test relative to a wild-type mouse. 

132. A method of producing a transgenic mouse comprising a disruption in a glucocorticoid-induced 
receptor gene, wherein the transgenic mouse exhibits at least one of the following phenotypes: 
hyperactivity, reduced anxiety, a decreased propensity toward behavioral despair, or a 
decreased propensity toward depression, the method comprising: 

(a) introducing a glucocorticoid-induced receptor gene targeting construct into a cell; 

(b) introducing the cell into a blastocyst; 

(c) implanting the resulting blastocyst into a pseudopregnant mouse, wherein said 
pseudopregnant mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse comprising a disruption in 
a glucocorticoid-induced receptor gene. 

133. A transgenic mouse produced by the method of claim 132. 

134. A cell derived from the transgenic mouse of claim 125 or claim 133. 

135. A method of identifying an agent that ameliorates a phenotype associated with a disruption in a 
glucocorticoid-induced receptor gene, the method comprising: 

(a) administering an agent to, a transgenic mouse comprising a disruption in a glucocorticoid- 
induced receptor gene; and 

(b) determining whether the agent ameliorates at least one of the following phenotypes: 
hyperactivity, reduced anxiety, decreased propensity toward behavioral despair, or decreased 
propensity toward depression. 

136. A method of identifying an agent that modulates glucocorticoid-induced receptor expression, 
the method comprising: 

(a) administering an agent to the transgenic mouse comprising a disruption in a 
glucocorticoid-induced receptor gene; and 

(b) determining whether the agent modulates glucocorticoid-induced receptor expression in 
the transgenic mouse, wherein the agent has an effect on at least one of the following 
behaviors: hyperactivity, anxiety, behavioral despair, or depression. 
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137. A method of identifying an agent that modulates a behavior associated with a disruption in a 
glucocorticoid-induced receptor gene, the method comprising: 

(a) administering an agent to a transgenic mouse comprising a disruption in a glucocorticoid- 
induced receptor gene; and 

(b) determining whether the agent modulates hyperactivity, anxiety, behavioral despair, or 
depression. 

138. A method of identifying an agent that modulates glucocorticoid-induced receptor gene function, 
the method comprising: 

(a) providing a cell comprising a disruption in a glucocorticoid-induced receptor gene; 

(b) contacting the cell with an agent; and 

(c) determining whether the agent modulates glucocorticoid-induced receptor gene function, 
wherein the agent modulates a phenotype associated with a disruption in a glucocorticoid- 
induced receptor gene. 

139. The method of claim 138, wherein the phenotype comprises at least one of the following: 
hyperactivity, reduced anxiety, decreased propensity toward behavioral despair, or decreased 
propensity toward depression. 

140. An agent identified by the method of claim 135, claim 136, claim 137, or claim 138. 

141. A transgenic mouse comprising a disruption in a glucocorticoid-induced receptor gene, wherein 
the transgenic mouse exhibits hyperactivity, reduced anxiety, a decreased propensity toward 
behavioral despair, or a decreased propensity toward depression relative to a wild-type mouse. 

142. An agonist or antagonist of a glucocorticoid-induced receptor. 

143. Phenotypic data associated with the transgenic mouse of claim 125 or claim 133, wherein the 
data is in a database. 
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TCTAGACTGGACAGCATCCACAAGAGAAGCACCTAGAAGGAGAATTTTCCCCAGCAGCTTGCTCAGGACCCTGCA 
GGAGCCGCAGCTGGGACTGGACCTGCTGTTAACCATGAACTCTTCCTGCTGCCTGTCTTCTGTTTCTCCGATGCT 
GCCTAACCTCTCTGAGCACCCTGCAGCCCCTCCTGCCAGCAACCGGAGCGGCAGTGGGTTCTGTGAGCAGGTCTT 
CATCAAGCCGGAGGTCTTCCTGGCTCTGGGCATCGTCAGTCTGATGGAAAACATCCTGGTGATCCTGGCTGTGGT 
CAGGAATGGCAACCTGCACTCTCCCATGTACTTCTTCCTGTGCAGCCTGGCTGCAGCCGACATGCTGGTGAGCCT 
GTCCAACTCCCTGGAGACCATCATGATCGCCGTGATCAACAGCGACTCCCTGACCTTGGAGGACCAGTTTATCCA 
GCACATGGATAATATCTTCGACTCTATGATTTGCATCTC 

CATCGACAGGTACGTCACCATCTTCTATGCCCTTCGGTACCACAGCATCATGACAGTTAGGAAAGCCCTCACCTT 

GATCGGGGTCATCTGGGTCTGCTGCGGCATCTGCGGCGTGATGTTCATCATCTACTCCGAGAGCAAGATGGTCAT 

CGTGTGTCTCATCACCATGTTCTTCGCCATGGTGCTCCTCATGGGCACCCTATATATCCACATGTTCCTC 

CAGGCTCCACGTCCAGCGCATCGCAGTGCTGCCCCCTGCTGGCGTGGTGGCCCCACAGCAGCACTCCTGCATGAA 

GGGGGCTGTCACCATCACTATCCTGCTGGGTGTTTTCATCTTCTGCTGGGCGCCTTTCTT^ 

CATCATCACCTGCCCCACCAATCCCTACTGCATCTGCTACACGGCCCATTTCAACACCTACCTGGTTCTCATCAT 

GTGCAACTCCGTCATCGACCCCCTCATCTACGCCTTCCGCAGCCTGGAGCTGCGCAACACGTTCAAGGAGATTCT 

CTGCGGCTGCAACAGCATGAACTTGGGCTAGGATGCCCGTGGAGGTGTTCCACATCCAGCCAAGAGACAAA 

ACGCTCAGACGGGACGTAAAAGGGTGTTAGGAGCTGGAACTGTGCTO^ 

GCAGACGGGACACGGCGTAGGATGGGCTGTCTGTGAGGATCTGTGTGTGGGTAAGTCAGTTTGATCTAGCACATA 

GCCTGGAAGAATCAGGCAAAGCAGCCCTGAGTGTCATCTGTGTTCATTGCTAGGCACCCAGGGTTTGTC 

GCCTGCTTATTGGCTTTGTACCAGTAACTGTGCTTCAAGCCAACCAGACCGGAGGGCTCTCGTGAGCAGAAAGAG 

TGCTTAGACTTCCGGCAAGCATCCTGGCTCACAGCGGCCACCTCCTGACCACTACCGGGAGAGCTTTGCACATAT 

TCTGTGGGAGATTGAGTGAAGCCCTGAAAACAATGTGATATTTGCTGCTCCCTTCCAGAA 

GCCTCCCCGAACCCCTGCACAGAGACATGACCCCCTTCTC 

GTTTTGTTCGTTAAAATCTAAGCTT (SEQ ID NO:l) 



MSTSSCCLSSVSPMLPlSrLSEHPAAPPASlSrRSGSGFCEQWIKPEVFLAI^ 

FLCSLAAADMLVSLSNSLETIMIAVINSDSLTLEDQ 

RYHSI>m/RKALTLIGVIWCCGICGVMFIIYSESKMVIVCLI 

PAGWAPQQHSCMKGAVTITILLGVF I FCWAPFFLHLVL I ITC PTNPYC I C YTAHFNTYLVLIMCNSVIDPLI YA 
FRSLELRNTFKEILCGCNSMNLG (SEQ ID NO:2) 
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Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



TCTAGACTGGACAGCATCCACAAGAGAAGCACCTAGAAGGAGAATTTTCCCCAGC^GCTT 

GCTCAGGACCCTGCAGGAGCCGCAGCTGGGACTGGACCTGCTGTTAACCATGAACTCTTC 

CTGCTGCCTGTCTTCTGTTTCTCCGATGCTGCCTAACCTCTCTGAGCACCCTGCAGCCCC 

TCCTGCCAGCAACCGGAGCGKK1AGTGGGTTCTGTGAGCAGGTCTTCATCAAGCCGGAGGT 

CCTCCTGGCTCTGGGCATCGTCAGTCTGATGGAAAACATCCTGGTGATCCTGGCTGTGGT 

CAGGAATGGCAACCTGCACTCTCCCATGTACTTCTTCCTGTGCAGCCTGGCTGCAGCCGA 

CATGCTGGTGAGCCTGTCCAACTCCCTGGAGACCATCATGATCGCCGTGATCAACAGCGA 

CTCCCTGACCTTGGAGGACCAGTTTATCCAGCACATGGATAATATCTTCGACTCTATGAT 

TTGCATCTCCCTGGTGGCCTCCATCTGCAACCTCCTGGC CATTGCCATCGACAGGTACGT 

CACCATCTTCTATGCCCTTCGGTACCACAGCATCATGACAGTTAGGAAAGCCCT 

GATCGGGGTCATCTGGGTCTGCTGCGGCATCTGCGGCGTGATGTTCATCATCTACTCCGA 

GAGCAAGATGGTCATCGTGTGTCTCATCACCATGTTCTTCGCCATGGTGCTCCTCATGGG 

CACCCTATATATCCACATGTTCCTCTTCGCCAGGCTCCACGTCCAGCGCATCGCAGTGCT 

GCCCCCTGCTGGCGTGGTGGCCCCACAGCAGCACTCCTGCATGAAGGGGGCTGTCACCAT 

CACTATCCTGCTGGGTGTTTTCATCTTCTGCTGGGCGCCTyTTCTTCCTCCACCTGGTCCT 

CATCATCACCTGCCCCACCAATCCCTACTGCATCTGCTACACGG^ 

CCTGGTTCTCATCATGTGCAACTCCGTCATCGACCCCCTCATCTACGCCTTCCGCAGCCT 
GGAGCTGCGCAACACGTTCAAGGAGATTCTCTGCGGCTGCAACAGCATGAACTTGGGCTA 
GGATGCCCGTGGAGGTGTTCCACATCCAGCCAAGAGACAAAAACAACGCTCAGACGGGAC 
GTAAAAGGGTGTTAGGAGCTGGAACTGTGCTTGGCTTCGTCTGTAAGCTCGTGGCCCTTT 

GCAGACGGGACACGGCGTAGGATGGGCTGTCTGTGAGGATCTGTGTGTGGGTAAGTCAGT 

TTGATCTAGCACATAGCCTGGAAGAATCAGGCAAAGCAGCCCTGAGTGTCATCTGTGTTC 

ATTGCTAGGCACCCAGGGTTTGTGGCCCCTGCCTGCTTATTGGCTTTGTACCAGTAACTG 

TGC TTC AAGC CAAC CAGACC GG AGGGCTC TC GTGAGC AG AAAGAGTGC TT AG AC TTC CGG 

CAAGCATCCTGGCTCACAGCGGCCACCTCCTGACCACTACCGGGAGAGCTTTGCACATAT 

TCTGTGGGAGATTGAGTGAAGCCCTGAAAACAATGTGATATTTGCTGCTCCCTTC 

CTTACATCTGTGCCAGCCTCCCCGAACCCCTGCACAGAGACATGACCCCCTTCTCCCTGT 

GCCGTTGTCATGGTTGTTATTATTGTTGGAGTTTTC 
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Gene Sequence Structure * 
Size of CDS: 1675 bp 



Arm Length: 
5': 1.6 kb 
3': 4 kb 



296 bp 



Sequence Deleted 



Targeting Vector* 
(genomic sequence) 

Construct Number: 2787 



LacZ-Neo 



519 bp 



Targeting Vector 



• Not drawn to scale 



5 1 >ACCTGCTGTTAACCATGAACT 

CTTCCTGCTGCCTGTCTTCTGTTT 

CTCCGATGCTGCCTAACCTCTCTG 

AGCACCCTGCAGCCCCTCCTGCCA 

GCAACCGGAGCGGCAGTGGGTTCT 

GTGAGCAGGTCTTCATCAAGCCGG 

AGGTCTTCCTGGCTCTGGGCATCG 

TCAGTCTGATGGAAAACATCCTGG 

TGATCCTGGCT< 3 ' (SEQ ID 

NO:3) 



5 1 >CATTGCCATCGACAGGTACGT 
C AC C ATCTTC TATGC CCTTC GGTA 
CCACAGCATCATGACAGTGAGGAA 
AGC CCTCACCTTGATCGGGGTCAT 
CTGGGTCTGCTGCGGCATCTGCGG 
CGTGATGTTCATCATCTACTCCGA 
GAGCAAGATGGTCATCGTGTGTCT 
C ATCAC C ATGTTCTTCGC CATGGT 
GCTCCTCATGG<3 1 (SEQ ID 
NO:4) 



FIGURE 2B 

SUBSTITUTE SHEET (RULE 26) 
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ACTGTCTGGAACTGGACTGAGTCACCAAAAGGCGAATGGCTTCATCOT^ 
CTGAGCACATTTTACAGAAGACATGTGATCACCTGATC^ 

CAGAGGAAATGAAGCAGACTGTGGAGGGACAGGGGCATACAGTGCACTGGGCAGCTCTCCTGATACTCGCGGTGA 

TAATACCCACCATTGGTGGGAACATCCTTGTGATTCTGG 

ACTACTTTTTAATGTCCTTGGCGATAGCAGAl^ 

TCATGTTTGAGGCTATATGGCCCCTCCCACTGGCCCTGTGTCCTGCCTGGTTATTCCTCGATGTTCTCTTTTCAA 
CTGCCTCCATCATGCATCTCTGTGC 

AGTGCAACACCCGGGCTACTGCATTCATCAAGATTACAGTGGTATGGTTAATTTCAATAG© 

TCCCTATTAAAGGAATCGAGACTGATGTGATTAATCCACACAATGTCACCTGTGAGCTGACAAAGGACCGCTTTG 

GCAGTTTTATGGTCTTTGGGTCACTGGCTGCTTTCT^ 

CCATTCACACTTTACAGAAGAAAGCTTACTTGGTCAAAAATAAGCCACCTCAACGCCTAACACGGTGGACTGTGC 

CCACAGTTTTCCTAAGGGAAGACTCATCCTTTTCATCACCAGAAAAGGTGGCAATGCTGGATGGGTCTCACAGGG 

ATAAAATTCTACCTAACTCAAGTGATGAGACACTTATGCGAAGAATGTCCTCAGTTGGAAAAAGATCAGCCCAAA 

CCATTTCTAATGAGCAGAGAGCCTCGAAGGCCCTTGGAGTCGTGTTTTTCCTTTTTCTGCTT 

TTTTTATTACAAATCTAACTTTAGCTCTGTGTGATTCCTGCAATCAGACCACTCTCAAAACAC 

TTGTGTGGATAGGCTACGTTTCCTCGGGGGTGAATCCTCTGATCTATACACTCTTCAATAAGACATTTCGGGAA^ 

CATTTGGCAGGTACATCACCTGCAATTACCGAGCCACAAAGT 

TTTGTTTTGGGAATTCAATGGTAGAAAACTCTAAATTTTTCACAAA^ 

CCATGTACCAGAGCCCAATGAGGCTCCGATGTTCAACCATTCAGTCCTCATCAATCATCCTCCTCGATACCCTTC 
TCACTGAAAACGATGGCGACAAAGCGGAAGAGCAGGTCAGCTACATATTGCAGGAACGGGCCGGCCTCATCTTGA 
GAGAGGGTGATGAGCAGGACGCACGCGCACCATGGCAGGTTCAAGAGTGA (SEQ ID NO:5) 



MASSYKMSEQSTTSEHILQKTCDHLILTNRSGLETDSVAEEMKQTVEGQGHTVHWA^ 

LAVALEKRLQYATNYFLMSLAIADLLVGLFVMPI ALLTIMFEAIWPLPLALC PAWLFLDVLFSTAS IMHLC AI SL 

DRYIAIKKPIQANQCOTRATAFIKITWWLISIGI^ 

FVPLTIMVVTYFLTIHTLQKKAYLVKI^PPQRLTRWTVPTVFLREDSS 

MRRMSSVGKRSAQTISNEQRASKALGVVFFLFLLMWCP 

PLIYTLFNKTFREAFGRYITCNYRATKSVKALRKFSSTLCFGNSMV^ 

TIQS SS I ILLDTLLTENDGDKAEEQVS YI LQERAGL ILREGDEQDARAPWQVQE (SEQ ID NO:6) 
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Underlined = deleted in targeting construct 

Bold » sequence flanking Neo insert in targeting construct 



ACTGTCTGGAACTG6ACTGAGTCACCAAAAGGC6AAT6GCTTCATCTTATAAAATGTCT6 
AACAAAGCACAACTTCTGAGCACAMOTACAGAAG 

ACCGTTCTG GATTAGAGACAGACTCAGTAGCAGAGGAAATGAAGCAGACTGTGGAGGGAC 
AGGGGCATACAGTGCACTGGGCAGCTCTCCTGATACTCGCGGTGATAATACCCACCATTG 
GTGGGAACATCCTTGTGATTCTGGCTGTTGCACTGGAGAAAAGGCTGCAGTACGCTACCA 
ACTACTTTTTAATGTCCTTG GCGATAG(^GAT^ 

TTGCCCTCTTGACAATCATGTTTGAGGCTATATGGCCCCTCCCACTGGCCCTGTGTCCTG 
CCTGK5TTATTCCTCGATGTTCTCTTTTCAACTGCCTCCATCATGCATCTCTGTGCCATTT 
CCCTGGACCGCTATATAGCCATCAAAAAGCCAATTCAGGCCAATCAGTGCAACACCCGGG 
CTACTGCATTCATCAAGATTACAGTGGTATGGTTAATTTCAATAGGCATCGCCATCCCAG 
TCCCTATTAAAGGAATCGAGACTGATGTGATTAATCCACACAATGTCACCTGTGAGCTGA 
CAAAGGACCGCTTTGGCAGTTTTATGGTCTTT 

TCACCATCATGGTAGTCACTTACTTTCTCACCATTCACACTTTACAGAAGAAAGCTTACT 
TGGTCAAAAATAAGCCACCTCAACGCCTAACACGGTGGACTGTGCCCACAGTTTTCCTAA 
GGGAAGACTCATCCTTTTCATCACCAGAAAAGGTGGCAATGCTGGATGGGTCTCACAGGG 
ATAAAATTCTACCTAACTGAAGTGATGAGACACTTATGCGAAGAATGTCCTCAGTTGGAA 
AAAGATCAGCCCAAACCATTTCTAATGAGCAGAGAGCCTCGAAGGCCCTTGGAGTCGTGT 
TTTTCCTTTTTCTGCTTATGTGGTGCCCCTTTTTTATTACAAATC 

GTGATTCC TGCAATC AGAC C ACTC TC AAAAC AC TC CTGGAGATATTTGTGTGGATAGG CT 
ACGTTTCCTCGGGGGTGAATCCTCTGATCTATACACTCTTCAATAAGACATTTCGGGAAG 
CATTTGGCAGGTACATCACCTGCAATTACCGAGCCACAAAGTCAGTAAAAGCACTTAGGA 
AGTTTTC C AGTACACTTTGTTTTGGGAATTC AATGGTAGAAAACTC TAAATTTTTC AC AA 
AACATGGAATTCGAAATGGGATCAACCCTGCCATGTACCAGAGCCCAATGAGGCTCCGAT 
GTTCAACCATTCAGTCCTCATCAATCATCCTCCTCGATACCCTTCTCACTGAAAACGATG 
GCGACAAAGCGGAAGAGCAGGTCAGCTACATATTGCAGGAACGGGCCGGCCTCATCTTGA 
GAGAGGGTGATGAGCAGGACGCACGCGCACCATGGCAGGTTCAAGAGTGA 
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Gene Sequence Structure * 



130 bp 



Sequence Deleted 



319 bp 



Size of full-length 
cDNA: 1550 bp 




Targeting Vector* 
(genomic sequence) 

Construct Number: 2520 



Arm Length: 
5': 1.6 kb 
3': 5 kb 



LacZ-Neo 



* Not drawn to scale 




5 1 >TGAGTGTCTGGTGGGTTTGCT 

AAATGCTTTGCTAAAGCAGATGAC 

TTGCTTAGCTACTGACCATGCTGA 

CCACTGTCTGGAACTGGACTGAGT 

C ACC AAAAGGCGAATGGC TTCATC 

TTATAAAATGTCTGAACAAAGCAC 

AACTTCTGAGCACATTTTACAGAA 

GACATGTGATCACCTGATCCTGAC 

TAACCGTTCTG<3 ' (SEQID 

NO:7) 



5 1 >GGC GATAGCAGATTTGCTGGT 

TGGATTGTTTGTGATGCCGATTGC 

CCTCTTGACAATCATGTTTGGTGA 

GTATTTCCCCTTGTTCCTGCCACT 

GAACACTACTAACGTAGTGAAATG 

GACACTCACTGACCTTTATTTTGT 

TTGAAATAAAAGAAGGACCTGGAT 

TAAAAACACAGAAGGGAACATTCC 

TTC ATTTTTC A< 3 1 

(SEQIDNO:8) 
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SUBSTITUTE SHEET (RULE 26) 
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aatattttccttgacctaatgccatcttgtgtccccttgcagagccctattcctaacatggctgatgactatggc 

tctgaatccacatcttccatggaagactacgttaacttcaacttcactgacttctactgtgagaaaaacaatgtc 

aggcagtttgcgagccatttcctcccacccttgtactggct^ 

gttatccttgtctactggtactgcacaagagtgaagaccatgaccgacatgttccttttg 

gacctcctctttcttgtcactcttcccttctgggccattgctgctgctgaccagtggaagttccagac 

tgcaaggtggtcaacagcatgtacaagatgaacttctacagctgtgtgttgctgatcatgtgcatcagcgtggac 

aggtacattgccattgcccaggccatgagagcacatacttggagggagaaaaggcttttgtacagcaaaatggtt 

tgctttaccatctgggtattggcagctgctctctgcatc 

ggcattgctatctgcaccatggtttaccctagcgatgagagcaccaaactgaagtcagctgtcttgaccc 

gtcattctggggttcttccttcccttcgtggtcatggcttgctgctataccatcatcattcacaccctgatacaa 

gccaagaagtcttccaagcacaaagccctaaaagtgaccatcactgtcctgaccgtctttgtcttgtctcagttt 

ccctacaactgcattttgttggtgcagaccattgacgcctatgccatgttcatctccaactgtgccgtttccacc 

aacattgacatctgcttccaggtcacccagaccatcgccttcttccacagttgcctgaaccctgttctctatgtt 

tttgtgggtgagagattccgccgggatctcgtgaaaaccctgaagaacttgggttgcatcagccaggcccag 

gtttcatttacaaggagagagggaagcttgaagctgtcgtctatgttgctggagacaacctcaggagcactc 

ctctgaggggtcttctctgaggtgcatggttcttttc 

tgggaccagagagagtgaaagagaaaagaaaactcagaaagggatgaatctgaactatatgattacttgtagtca 

gaatttgccaaagcaaatatttcaaaatcaactgactagtgcaggaggctgttgattggctcttgactgtgat^ 

ccgcaattctcaaaggaggactaaggaccggcactgtggagca^ 

ccgctgcctctggaggagcccttggattttctccatgcactgtgaacttctgtggct^ 

cttccaaaaggggacacagaagcactggctgctgctacagacc^ 

ctttgggaaattttctaccctgctcttgagcctgataacccat 

ttccaggcaatctcagacctaatttccttc 

ttttgaaacgatctgcaggtcttgccagtgaacccc^ 

tggcttccaatccatttctgtgtcctgctggaggttttaacctagacaaggattc 

gacagtgtctctccatggcctgagcagggagattataacagctgggttcgcaggagccagccttggccctgttgt 
aggcttgttctgttgagtggcacttgcttt 

GGCCC TCTTCTTTCTG AGGC C C ACTTTATTC TGAGGAATAC AGTGAGCAGATATGGGC AGC AGC C AGGTAGGGC A 
AAGGGGTGAAGCGCAGGCCTTGCTGGAAGGCTATTTACTTCCATGCTTCTCCTTT 
ATTTTAAAAGCTTTTAACTTAGAGATTAGGCTGAAAAAAATAAGTAATG 
TTTCTTATCATGATTTGGCAAAATGCATCACCTTTGAAAATATTTC 

ATATGAAGC ATTAATTAC TTGTC AC TTTCTTTAC C C TGTC TC AATATTTTAAGTGTGTGC AATTAAAGATC AAAT 

AGATACATTAAGAGTGTGAAGGCTGGTCTGAAGGTAGTGA 

ATTGAACTCCTTGTTCTACTTCCCTG£ 

AGCAATAGGGATAAGGAAATAAGATCT (SEQ ID NO:9) 



MADDYGSESTSSMEDYVNFNFTDFYCEKNNW 

IS^AIADLLFLWLPFWAIAAADQWKFQTFMCKVVNS^ 

YSKMVCFTIWVLAAALCIPEILYSQIK^^ 

HTLIQAKKS SKHKALKVTI TVLTVFVLSQFPYNC ILLVQT IDAYAMF I SNCAVSTNI DIC FQVTQTI AFFHSCLN 
PVLYVFVGERFRRDLVKTLKNLGCISQAQW (SEQ ID NO: 10) 
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Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



AATATTTTCCTTGACCTAAT6CCATCTTGTGTCCCCTTGCAGAGCCCTATTCCTAACATG 
GCTGATGACTATGGCTCTGAATCCACATCOTCCATGGAAGACTACGTTJ^CTTCAACTTC 
ACTGACTTCTACTGTGAGAAAAACAATGTCAGGCAGTTTGCGAGCCATTTCCTCCCACCC 
TTGTACTGGCTCGTGTTCATCGTGGGTGCCTTGGGCAACAGTCTTGTTA TCCTTGTCTAC 

TGGTACTGCACAAGAGTGAAGACCATGACCGACATGTTCCTTTTGAATTTGGCAATTGCT 

GACCTCCTCTTTCTTGTCACTCTTCCCTTCTGGGCCATTGCTGCTGCTGACCAGTGGAAG 

TTCCAGACCTTCATGTGCAAGGTGGTCAACAGCAT GTACAAGATGAACTTCTACAGCTGT 

GTGTTGCTGATCATGTGCATCAGCGTGGACAGGTACATTGCCATTGCCCAGGCCATGAGA 

GCACATACTTGGAGGGAGAAAAGGCTTTTGTACAGCAAAATGGTTTGCTTTACCATCT 

GTATTGGCAGCTGCTCTCTGCATCCCAGAAATCTTATACAGCCAAATCAAGGAGGAATCC 

GGCATTGCTATCTGCACCATGGTTTACCCTAGCGATGAGAGCACCAAACTGAAGTCAGCT 

GTCTTGACCCTGAAGGTCATTCTGGGGTTCTTCCTTCCCTTCGTGGTCATGGCTTGCTGC 

TATACCATCATCATTCACACCCTGATACAAGCCAAGAAGTCTTCCAAGCACAA^ 

AAAGTGACCATCACTGTCCTGACCGTCTTTGTCTTGTCTCAGTTTCCCTACAACTGCA^ 

TTGTTGGTGCAGACCATTGACGCCTATGCCATGTTCATCTCCAACTGTGCCGTTTCCACC 

AACATTGACATCTGCTTCCAGGTCACCCAGACCATCGCCTTCTTCCACAGTTGCCTGAAC 

CCTGTTCTCTATGTTTTTGTGGGTGAGAGATTCCGCCGGGATCTCGTGAAAACCCTGAAG 

AACTTGGGTTGCATCAGCCAGGCCCAGTGGGTTTCATTTACAAGGAGAGAGGGAAGCTTG 

AAGCTGTCGTCTATGTTGCTGGAGACAACCTCAGGAGCACTCTCCCTCTGAGGGGTCTTC 

TCTGAGGTGCATGGTTCTTTTGGAAGAAATGAGAAATACATGAAACAGTTTCCCCACTGA 

TGGGAC CAGAGAGAGTGAAAGAGAAAAGAAAAC TC AGAAAGGGATGAATCTGAACTATAT 

GATTACTTGTAGTCAGAATTTGCCAAAGCAAATATTTCAAAATCAACTGACTAGTGCAGG 

AGGCTGTTGATTGGCTCTTGACTGTGATGCCCGCAATTCTCAAAGGAGGACTAAGGACCG 

GCACTGTGGAGCACCCTGGCTTTGCCACTCGCCGGAGCATCAATGCCGCTGCCTCTGGAG 

GAGCCCTTGGATTTTCTCCATGCACTGTGAAC 

CTTC CAAAAGGGGAC AC AGAAGC AC TGGCTGC TGC TAC AG AC CGC AAAAGC AGAAAGTTT 
CGTGAAAATGTCCATCTTTGGGAAATTTTCTACCCTGCTCTTGAGCCTGATAACCCATGC 
CAGGTCTTATAGATTCCTGATCTAGAACCTTTCCAGGCAATCTCAGACCTAATTTCCTTC 
TGTTCTCCTTGTTCTGTTCTGGGCCAGTGAAGGTC^ 

C AGGTC TTGC C AGTGAAC C C C TGGACAAC TGAC CAC ACC C ACAAGGC ATC C AAAGTC T GT 
TGGC TTCCAATCC ATTTC TGTGTCC TGCTGGAGGTTTT AAC C TAGAC AAGGATTC CGCTT 
ATTCCTTGGTATGGTGACAGTGTCTCTCCATGGCCTGAGCAGGGAGATTATAACAGCTGG 
GTTCGCAGGAGCCAGCCTTGGCCCTGTTGTAGGCTTGTT 
GGGTCCACCGTCTGTCTGCTCCCTAGAAAATGGGCTGGTTCTT^ 

GAGGCCCACTTTATTCTGAGGAATACAGTGAGCAGATATGGGCAGCAGCCAGGTAGGGCA 

AAGGGGTGAAGCGCAGGCCTTGCTGGAAGGCTATTTACTTCCATGCTTCTC 

CTC TATAGTGGC AAC ATTTTAAAAGCTTTTAAC TTAGAGATTAGGCTGAAAAAAATAAGT 

AATGGAATTCACCTTTGCkTCTTTTC 

CCTTTGAAAATATTTCACATATTGGAAAAGTGCTTTTTAATGTGTATAT 
TACTTGTCACTTTCTTTACCCTGTCTCAATATTTTAAGTGTC 

AGATAC^TTAAGAGTGTGAAGGCTGGTCTGAAGGTAGTGAGCTATCTCAATCGGATTGTO 
CACACTCAGTTACAGATTGAACTCCTTGTTCTACTTCCCTGCTTCTCTCTACTGCAATTG 
ACTAGTCTTTAAAAAAAAGTGTGAAGAGTAAGCAATAGGGATAAGGAAATAAGATCT 



FIGURE 6A 



WO 02/03793 



9/33 



PCT/US01/21923 



Gene Sequence Structure * 231 bp 



Sequence Deleted 394 bp 



Size of CDS: 2577 bp 




Targeting Vector* 
(genomic sequence) 

Construct Number: 3340 



Arm Length: 
5': 2.7 kb 
3':4kb 



LacZ-Neo 




* Not drawn to scale 



5 1 >TATTCCTTACAGAGCCTTATT 
CCTGGCATGTTTGATGACTTCAGC 
TATGACTC CACTGCTTCCACAGAT 
GACTACATGAATTTGAATTTCAGT 
AGCTTCTTCTGTAAGAAAAATAAT 
GTCAGGCAGTTTGCAAGCCATTTT 
CTCCCACCTCTGTACTGGCTTGTG 
TTCATTGTGGGCACCTTGGGCAAC 
AGCCTGGTCAT< 3 1 
(SEQ.ID.NO.il) 



5 1 >TGTAC AAGATGAAC TTC TACA 
GCTGTGTGC TTC TCATC ATGTGC A 
TCAGTGTGGACAGATACATTGCCA 
TTGTACAGGCCATGAAGGCTCAGG 
TC TGGAGGC AG AAAAGGC C GCTAT 
ACAGCAAGATGGTCTGCATTACCA 
TCTGGGTGATGGCAGCTGTGCTCT 
GCACCCCAGAAATCCTGTACAGTC 
AAGTC AGTGGG< 3 ■ 
(SEQ.ID.NO.12) 



FIGURE 6B 

SUBSTITUTE SHEET (RULE 26) 



WO 02/03793 



PCT/US01/21923 



10/33 



CATGAAGGTTCCTCCTGTCCTGCTTCTCTTTCOT 

TGAGCATCCCAGCATGGAGGCAGCCCTGACCGGGCCCAACGCCTCCTCGCACTTCTGGGCCAACTACACTTTCTC 
TGACTGGCAGAACTTCGTGGGCAGGAGACGTTATGGGGCCGAGTCCCAGAACCCCACGGTGAAAGCACTGCTCAT 
CGTGGCCTACTCATTCACCATCGTCTTC TCGCTC TTCGGTAATGTCC TGGTCTGTCATGTCATCTTCAAGAACCA 
GCGCATGCACTCGGCCACCAGCCTCTTCATTGTCAACCTGGCAGTGGCGGACATCATGATCACATTGCTCAACAC 
GCCCTTCACTTTGGTCCGCTTTGTGAACAGCAC^^ 

TQAGTACTGTTCTCTACATGTCTCAGCACTGACTCTGACAGCTATCGCAGTGGACCGCCACCAGGTCATCATGCA 
TCCACTGAAGCCTCGGATCTCCATCACCAAGGGTGTCATATATATTGCTGTCATCTGGGTCATGGCTACCTTCTO 
CTCTCTGCCACATGCCATCTGCCAGAAACTGTTTACC 

GCCGGACTTCCCGGAGCCAGCTGACCTCTTCTGGAAGTATCTGGACCTGGCCACCTTCATCCTGCTCTACCTACT 

TCCACTCTTCATTATCTCAGTGGCCTATGCTCGTGTGGCCAAGAAGCTGTGGCTCTGTAACACCATTGGCGACGT 

GACCACAGAGCAGTACCTCGCCCTGCGACGCAAGAAGAAGACCACCGTGAAGATGCTGGTGCTTGTGGTAGTCCT 

CTTTGCCCTCTGCTGGTTCCCTCTCAACTGCTATGTCCTCCTCTTGTCCAGCAAGGCCAT 

CCTCTACTTTGCCTTCCACTGGTTTGCCATC^ 

GAACTTTAGGGTTGAGCTTAAGGCATTGCTGAGCATGTGCCAAAGGCCACCCAAGCCGCAGGAAGACAGGCTACC 

CTCCCCAGTTCCTTCCTTCAGGGTGGCATGGACAGAGAA 

CTTGCCCTCTTCCCAGATCCAGTQTGGGAAGACAGATCTC 

AAAGCTGGAAGTTGGTGGGGGAGGGTTCTTTCCTCTCACAATTGACCAGACACTAACAGAGTTGGAAAGTAACAC 
AGAAGCAGTGAGATGCTTGGGTCCCTAGGAACCTGTCCAGC 

TGAGGTGGTGTGTGTAGATCTTTGAGCAAGAGCTCTGGAAACCACCTCAGCTTCAACAGAGGCTGGTCCAGTCAA 
CCACCTCCAATTGTGTAGCATCTGCCACCTTGCCCTTCCTACTGCTGAGCAACCACAGGGGGACTTGAGCCATAC 
TATTGGTGGGCCTGCCCCACATGCTCAGAAAAGAACAGGCACAAA 

TCACACAGCTTCAGTGACCTTGGCTCTATCCATGACCAGACAGGACCCATTTTGGCTTCOT 

ATTAGTATTGCCACTTTGAAAAGTTCAGAAAAGTAAAGAAATG^ 

AAGAAAAAAAAAAGAAAAAGAAAGAAAAAAGCCTGTTAATATGC^ 

TGTGTACATTTGTACTTTTAAAATCCT^ 

TATATCAGAGTATTATTGAATTC (SEQDDNO:13) 



MKVPPVLLLFLLSSVRATEQPQWTEHPSMEAAM^ 

VAYSFTIWSLFG3WLVCHVIFKNQRMHSATSLFIVNLAVADIMITLIOT 

QYCSLHVSALTLTAIAVDRHQVIIMPL^^ 

PDF P EPADLFWKYLDL ATF I LL YLLPLF 1 1 SVAYARVAKKLWLCNT I GDVTTEQYLALRRKKKTTVKMLVLVVVL 
FALCWFPLNCYVLLLSSKAIHTNN^ 

SPVPSFRVAWTEKSHGRI^PLPISrHHLPSSQIQSGKTDLSSVEPWAMS (SEQ ID NO: 14) 



FIGURE 7 
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Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



66G6TGGCAGTCGGCACCATCAGGCTCCCTT6GCGTTTC66AGTTTTCTCTGTGGTCCCG 

ACTCTCCGGAGGATCTCGGTTGTCTCCCAAGTCGGAACCTGGCACGGTCCAGGTTCACTC 

GGAGGTCCGGGCTTCCTCTGTGCCCCGTGCCCCTCGCTCCCAGGCTCCCTCTGTGGTGTG 

GACTCCTCTAGCCCGGTGCGCTCAGCCCCTCGCACCCAGCCTCCAGGCACAGAGCCCGGC 

AGGGAGCTCAGCCCTTGTGCCTAGAGCTGCAGTGGCTGGACATGAAGGTTCCTCCTGTCC 

TGCTTCTCTTTCTTCTGTCCTCAGTGCGAGCTACTGAGCAACCGCAGGTCGTCACTGAGC 

ATCCCAGCATGGAGGCAGCCCTGACCGGGCCCAACGCCTCCTCGCACTTCTGGGCCAACT 

ACACTTTCTCTGACTGGCAGAACTTCGTGGGCAGGAGACGTTATGGGGCCGAGTCCCAGA 

ACCCCACGGTGAAAGCACTGCTCATCGTGGCCTACTCATTCACCATCGTCTTCTCGCTCT 

TCGGTAATGTCCTGGTCTGTCATGTCATCTTCAAGAACCAGCGCATGCACTCGGCCACCA 

GC C TC TTC ATTGTC AACCTGGC AGTGGC GGAC ATCATGATC AC ATTGC TC AAC ACGC C C T 

TCACTTTGGTCCGCTTTGTGAACAGCACATGGGTGTTTGGGAAGGGCATGTGTCATGTCA 

GTCGCTTTGCTCAGTACTGTTCTCTACATGTCTC AGCACTGACTCTGACAGCTATCGCAG 

TGGACCGCCACCAGGTCATCATGCATCCACTGAAGCCTCGGATCTCCATCACCAAGGGTG 

TCATATATATTGCTGTCATCTGGGTCATGGCTACCTTCTTCTCTCTGCCACATGCCATCT 

GCCAGAAACTGTTTACCTTCAAGTACAGTGAGGACATTGTGCGCTCCCTCTGCCTGCCGG 

ACTTCCCGGAGCCAGCTGACCTCTTCTGGAAGTATCTGGACCTGGCCACCTTCATCCTGC 

TCTACCTACTTCCACTCTTCATTATCTCAGTGGCCTATGCTCGTGTGGCCAAGAAGCTGT 

GGCTCTGTAACACCATTGGCGACGTGACCACAGAGCAGTACCTCGCCCTGCGACGCAAGA 

AGAAGACC ACCGTGAAGATGCTGGTGCTTGTGGTAGTCCTCTTTGC CCTCTGCTGGTTC C 

CTCTCAACTGCTATGTCCTCCTCTTGTCCAGCAAGGCCATCCACACCAACAATGCCCTCT 

ACTTTGCCTTCCACTGGTTTGCCATGAGCAGTACTTGTTATAACCCCTTC^ 

GGCTCAATGAGAACTTTAGGGTTGAGCTTAAGGCATTGCTGAGCATGTGCCAAAGGCCAC 

CCAAGCCGCAGGAAGACAGGCTACCCTCCCCAGTTCCTTCCTTCAGGGTGGCATGGACAG 

AGAAGAGCCATGGTCGGAGGGCTCCACTACCTAATCACCACTTGCCCTCTTCCCAGATCC 

AGTCTGGGAAGAC AGATCTGTC ATC TGTGGAAC CCGTTGTGGC C ATGAGTTAGGGAAAGC 

TGGAAGTTGGTGGGGGAG^GTTCTTTCCTCTCACAATTGACCAGACACTAACAGAGTTGG 

AAAGTAACAC AGAAGCAGTGAGATGC TTGGGTTCCTAGGAACCTGTC C AGC C C C ATCTGA 

TTTGCAAACTTTCTAGAAGATGCCATGAGGTGGTGTGTGTAGATCTTTGAGCAA 

TGGAAACCACCTCAGCTTCAACAGAGGCTGGTCCAGTCAACCACCTCCAATTGTGTAGCA 

TCTGCCACCTTGCCCTTCCTACTGCTGAGCAACCACAGGGGGACTTGAGCCATACTATTG 

GTGGGCCTGCCCCACATGCTCAGAAAAGAACAGGCACAAAGGCTTTCTGAAGTCA 

ACAGGAATAATCACACAGCTTCAGTGACCTTGGCTCTATCCATGACCAGACAGGACCCAT 

TTTGGCTTCTTAAAAACAAAGAGAAATTAGTATTGC C AC TTTGAAAAGTTC AGAAAAGTA 

AAGAAATGAGTTCAGCCCTCAATTTGTAAAAAAAGGAAAAAAGAAAAAAAAAAGAAAAAG 

AAAGAAAAAAGCCTGTTAATATGC TGTAAATTTATC TGTAGCTTTGC CTTCTGTGTGTGT 

ACATTTGTACTTTTAAAATCCTGAACTACACGTGTCCATGTAGATTGTAATAA 

GAAACTGGAATATATCAGAGTATTATTGAATTC 



FIGURE 8A 
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Gene Sequence Structure ' 

Size of partial 
cDNA: 2253 bp 



299 bp 



Sequence Deleted 



753 bp 




: J» ILL-. 



Targeting Vector* 
(genomic sequence) 

Construct Number: 463 



Arm Length: 
5': 2.5 kb 
3': 0.6 kb 



LacZ-Neo 



* Not drawn to scale 




5 ■ >CTGGCACGGTCCAGGTTCACT 

CGGAGGCCCGGGCTTCCTCTGTGC 

CCCGTGCCCCTCGCTCCCTGGCTC 

CCTCTGTGGTGTGGACTCCTCTAG 

CCCGGTGCGCTCAGCCCCTCGCAC 

CCAGCCTCCAGGCACAGAGCCCGG 

CAGGGAGCTCAGCCCTTGTGCCTA 

GAGCTGCAGTGGCTGGACATGAAG 

GTTTCTCCTGT<3» 

(SEQIDNO:15) 



5 ■ >CAGCACTGACTCTGACAGCTA 

TCGCAGTGGACCGCCACCAGGTGA 

GAGCACCTGTCCCCAGCAGCATGC 

TCCCATCTCCGTCTATGCCTGGCT 

GGCTGGTGGGAATACTGCCACCAC 

GGTCTGTAGGGAATACTCTCAGGA 

C AGTGACTC ATTCAGTCCC GCTGA 

CAGCGTGTGTGCTTGCCTCCTTGT 

TGATCAATTTG<3 1 

(SEQIDNO:16) 



FIGURE 8B 

SUBSTITUTE SHEET (RULE 26) 



WO 02/03793 
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AAGCTTGTTCACAGGTGGCCATGACCTCACTGCCCCCTGGAACCACTGGGGACCCCGATTTGTTTTCTGGGCCGT 
CGCCAGCCGGCTCCACTCCAGCCAACCAGAGTGCAGAGGCTTCAGAGAGCAATGTGTCTGCGACGGTTCCCAGAG 
CTGCAGCAGTCACGCCGTTCCAGAGCCTGCAACTAGTGCACCAGCTGAAGGGACTGATCGTGATGCTGTACAGCA 
TCGTGGTGGTCGTGGGTCTGGTGGGCAACTGCCTTC 

TGACCAACTTCCTCATCGGCAACCTGGCCTTGTCCGATGTGCTCATGTGTGCCGCCTGTGTGCCTCTCACGCTGG 

CCTACGCCTTTGAACCTCGTGGCTGGGTGTTCGGTGGAGGCCTGTGCCACCTTGTTTTCTTCCTGCAGCCGGTCA 

CCGTCTACGTATCGGTGTTCACACTCACCACAATCGCTGTGGACCGCTATGTGGTTCTGGTGCACCCGCTACGTC 

GGCGCATTTCACTGAAGCTCAGCGCCTACGCTGTGCTGGGCATCTGGGCTCTATCTGCAGTGCTGGCGCTGCCGG 

CCGCGGTGCACACCTACCATGTAGAGCTCAAGCCCCACGACGTGCGCCTCTGCGAGGAGTTCTGGGGTTCGCAGG 

AGCGCCAGAGACAGATCTATGCCTGGGGGCTGCTGCTGGGCACCTATTTGCTCCCCCTGCTGGCCATTCTCCTGT 

CTTAC GTC CGGGTGTCGGTGAAGTTGCGGAACCGCGTGGTGCCTGGC AGCGTGACCCAGAGCC AGGCTGACTGGG 

ACCGAGCGCGTCGCCGTCGCACTTTCTGCCTGCTGGTGGTGGTGGTGGTCGTGTTCGCGCTCTGCTGGC 

TGCACATTTTCAACCTTCTGCGGGACCTGGACCCGCGTGCCATCGACCCCTACGCCTTCGGGCTGGTGCAGCTCC 

TCTGCCACTGGCTTGCCATGAGCTCCGCCTGCTACAACCCCTTCATCTATGCGTGGCTGCACGACAGCTTCCGAG 

AGGAGCTACGCAAGATGCTTCTGTCTTGGCCCCGCAAGATCGTGCCTCATGGCCAGAATATGACCGTCAGTGTGG 

TCATCTGATGAGACTCTGCCAGGTCTGAGCTGGGGAGCT^ 

TATTCCCATTTGCCAACACAAGTGAAATTCCTACCCAGCCCTCTAGACTAGCTATATGTCTGTTTGTGTGCTTGT 
GTGAAATGTTAGATAGGCTTTGGAGTAGAAGAGCCTGAGGAGAGAGAAGCAGATTTATCAGGTCTACCCCTGAAA 
GCAAAAGAGATATTTTTCTGGGTTCAGAAAAAATAAATCAGAAGC^ 
TCTCTCTCTCTCTCTNNCTTTCTTTCT (SEQIDNO:17) 



MTSLPPGTTGDPDLFSGPSPAGSTPANQSAEASESNVSAWPRAAAWPFQSLQLVHQLKGLIVMLYSIVVVVGL 
VGNCLLVLVI ARVRRLHNVTNFL I GNLALSDVIjMCAACVPLTLAYAFEPRGVA^GGGLCHLVFFLQPVTVWSW 
TLTTIAVDRYVVLWPLRRRISLKLSAYAVLGIWALSAVLALPAAWTYHV^ 
AWGLLLGTYLLPLIAILLSYVRVSVKLRNRWPGSVTQSQADWDRARRRRTFCL^ 

RDLDPRAIDPYAFGLVQLLCHWLAMSSACYOT (SEQ 
IDNO:18) 



FIGURE 9 
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Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



AAGCTTGTTCACAGGTGGCCATGACCTCACTGCCCCCTGGAACCACTGGGGACCCCGATT 
TGTTTTCTGGGCCGTCGCCAGCCGGCTCCACTCCAGCCAACCAGAGTGCAGAGGCTTCAG 
AGAGCAATGTGTCTGCGACGGTTCCCAGAGCTGCAGCAGTCACGCCGTTCCAGAGCCTGC 
AACTAGTGCACCAGCTGAAGGGACTGATCGTGATGCTGTACAGCATCGTGGTGGTCGTGG 
GTCTGGTGGGCAACTGCCTTCTTGTGCTGGTG ATCGCGCGCGTGCGCCGGCTGCACAACG 
TGACCAACTTCCTCATCGGCAACCTGGCCTTGTCCGATGTGCTCATGTGTGCCGCCTGTG 
TGCCTCTCACGCTGGCCTACGCCTTTGAACCTCGTGGCTGGGTGTTCGGTGGAGGCCTGT 
GCCACCTTGTTTTC1TCCTGCAGCCGGTCACCGTCTACGTATC GGTGTTCACACTCACCA 
CAATCGCTGTGGACCGCTATGTGGTTCTGGTGCACCCGCTACGTCGGCGCATTTCACTGA 
AGCTCAGCGCCTACGCTGTGCTGGGCATCTGGGCTCTATCTGCAGTGCTGGCGCTGCCGG 
CCGCGGTGCACACCTACCATGTAGAGCTCAAGCCCCACGACGTGCGCCTCTGCGAGGAGT 
TCTGGGGTTCGCAGGAGCGCCAGAGACAGATCTATGCCTGGGGGCTGCTGCTGGGCACCT 
ATTTGCTCCCCCTGCTGGCCATTCTCCTGTCTTACGTCCGGGTGTCGGTGAAGTTGCGGA 
ACCGCGTGGTGCCTGGCAGCGTGACCCAGAGCCAGGCTGACTGGGACCGAGCGCGTCGCC 
GTCGCACTTTCTGCCTGCTGGTGGTGGTGGTGGTCGTGTTCGCGCTCTGCTGGCTGCCTC 
TGCACATTTTCAACCTTCTGCGGGACCTGGACCCGCGTGCCATCGACCCCTACGCCTTCG 
GGCTGGTGCAGCTCCTCTGCCACTGGCTTGCCATGAGCTCCGCCTGCTACAACCCCTTCA 
TCTATGCGTGGCTGCACGACAGKTTTCCGAGAGGAGCTACGCAAGATGCTTCTGTCTTGGC 
CCCGCAAGATCGTGCCTCATGGCCAGAATATGACCGTCAGTGTGGTCATCTGATGACACT 
CTGCCAGGTCTGAGCTGGGGAGCTTCAGAGTCATTAGCCCCGGAAGTCATCATGAGATCT 
TATTCCCATTTGCCAACACAAGTGAAATTCCTACCCAGCCCTCTAGACTAGCTATATGTC 
TGTTTGTGTGCTTGTGTGAAATGTTAGATAGGCTTTGGAGTAGAAGAGCCTGAGGAGAGA 
GAAGC AGATTTATC AGGTC TACCCC TGAAAGC AAAAGAGATATTTTTC TGGGTTC AGAAA 
AAATAAATCAGAAGCAGTCTTTCTTTCCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT 
NNCTTTCTTTCT 



FIGURE 1 0A 
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Gene Sequence Structure * 274 bp Sequence Deleted 462 bp 

Size of full-length fe^:.; C;F 

cDNA: 1452 bp 



Targeting Vector* 
(genomic sequence) 

Construct Number: 262 



Arm Length: 
5': 5.3 kb 
3': 1 kb 



Neo 



i ,1-" — i — J r- -,. Endogenous Lofcusj; 

* Not drawn to scale 




5 • >GTTGCCAGCCAGCTCCACTCC 

AGCCAACCAGAGCGCAGAGGCCTC 

GGAGGGCAACCTGTCTGCGACGGT 

TCCCCGAGCTGCAGCAGTCACGCC 

GTTCCAGAGCCTGCAGCTGGTGCA 

CCAGCTGAAGGGGTTGATCGTGAT 

GCTGTACAGCATTGTGGTGGTCGT 

GGGTCTGGTGGGCAACTGCCTTCT 

TGTGCTGGTGAO 1 

(SEQIDNO:19) 



5 ' >CGGTGTTCACACTCACCACAA 

TCGCCTTGGACCGCTATGTGGTTC 

TGGTGCACCCGCTACGTCGGTGCA 

TTTCACTGAGGCTCAGCGCCTACG 

CGGTGCTGGGCATCTGGGCTCTAT 

CTGCAGTGCTGGCGCTGCCGGCCG 

CGGTGCACACCTACCATGTGGAGC 

TCAAGCCCCACGACGTGCGCCTCT 

GCGAGGAGTTC < 3 ■ 

(SEQIDNO:20) 



FIGURE 10B 

SUBSTITUTE SHEET (RULE 26) 



WO 02/03793 



PCT/US01/21923 
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GGGACAGTGGGTCCCAATGGCTCTAGGGTCCTCCTGTGTAGCTC 

CTTCCAACAGGATATAGGACCTGGTGAGCCTTTGTCTCTCTGCATAGGGACAGTGACTGTGTCCATCACAGAGGC 
TGTTTAGGGCATAGAAGTAGGTTACTGCCTTGAACCTCTGACACTAATCTTTTCCCACAGGACAA 
GGCTCTCCTCACTGAGCAGTGGTTCTCCCCCTGGAATCCCAGTGTGAGGACCGAGATGGCTCTGAGCCTGGAGTC 
TACAACAAGCTTTCATATGCTCACCGTGTCCGGAAGCACTGTGA 

CAACAGTTCCTGGTCCGGCCCAACAGATCCCAGCTCCCTGAAAGACCTTGTGGCCACGGGTGTCATCGGGGCAGT 
GCTCTCAGCCATGGGTGTGGTGGGCATGGTGGGAAATGTATACACT^^ 

CTCGGCCTCCATGTACGTCTATGTGGTCAACCTAGCGCTGGCTGATCTGCTGTACCTGCTGAGCATTC 

CATAGCCACCTACGTCACTAAGGACTGGCACTTTGGAGATGTGGGCTGCAGAGTCCTCTTTAGCCTGGACTTCCT 

GAC^TGCACGCCAGCATCTTCACCCTGACCATAATGAGCAGCGAACGCTATGCAGCCGTACTGAGGCCTCTG^A 

CACAGTCCAGCGCTCCAAGGGTTACCGTAAGCTGCTGGTGCTGGGCACCTGGTTGCTGGCACTGCTGCTGACCCT 

ACCCATGATGCTTGCCATCCAGCTGGTCCGCAGGGGCTCTAAGAGCCTCTGCCTGCCAGCCTGGGGCCCTCGTGC 

CCACCGTACTTACCTAACGTTGCTCTTTGGGACCAGC^^ 

CCGTCTGGCCAGGGCCTACTGGCTATCTCAGCAAGCTTCTTTCAAGCAGACACGGCGGCTGCCCAACCCCAGGGT 
GCTCTACCTCATCCTTGGTATCGTCCTTCTCTTCTG 

CCAGTACCACGAGGCCATGCCACTGACTCCCGAGACTGCACGCATTGTCAACTACCTGACCACCTGCCTCACTTA 
TGGCAACAGTTGCATCAATCCCTTGCTCTACACTCTGCTCACCAAGAACTATCGAGAGTACCTACGTGGCCGCCA 
GCGGTCACTGGGTAGTAGTTGCCACAGCCCAGGGAGTCCTGGCAGCTTCCTGCCCAGCCGAGTCCACCTCCAGCA 
GGACTCGGGCCGCTCGCTGTCCTCCAGCAGCCAACAGGCCACAGAGACCCTCATGCTGTCTCCAGTCCCCCGTAA 
CGGGGCCCTTCTCTGAGAGTGCACTGTGCAATCCTGGCATAGGAAAGGACCCAAAGGCGTGCGGCTCCGGAGCGC 
ATTTCCCAGAATCCCCTGCTCAAACCTAACTGGCTCGTC (SEQ E>NO:21) 



MALSLESTTSFHMLWSGSTVTELPGDSNVSLNS SWSGPTDPS SLKDLVATGVIGAVLSAMGVVGMVGNVYTLVV 
MCRFLRASASMYVYWNLALADLLYLLSIPFIIATY^ 
AVLRPLDTVQRSKGYRKLLVLGTWLLALLLTLPMMIiAIQLVRRGSKSI^ 
VIGIiLYVRLARAYVtoSQQASFKQTRRL^ 

LTTCLTYGNSCINPLLYTLLTKI^REYLRGRQRSLGSSCHSPGSPGSFLPSRVHLQQDSGRSLSSSSQQATETIjM 
LSPVPRNGALL (SEQIDNO:22) 



FIGURE 11 



WO 02/03793 
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PCT/US01/21923 



Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



GGGACAGTGGGTCCCAATGGCTCTAGGGTCCTCCTGTGTAGCTGGGGAGATAACAAAAAA 
GGGATTCTTTTGAGGCTTCCAACAGGATATAGGACCTGGTGAGCCTTTGTCTCTCTGCAT 
AGGGACAGTGACTGTGTCCATCACAGAGGCT6TTTAGG6CATAGAA6TA6GTTACTGCCT 
TGAACCTCTGACACTAATCTTTTCCCACAGGACAAGTTTCCCACGGGCTCTCCTCACTGA 
GCAGTGGTTCTCCCCCTGGAATCCCAGTGTGAGGACCGAGATGGCTCTGAGCCTGGAGTC 
TACAAC2UIGCTTTCATATGCTCACCGTGTCCGGAAGCACTGTGACTGAGCTGCCTGGTGA 
CTCCAACGTGTCCCTCAACAGTTCCTGGTCCGGCCCAACAGATCCCAGCTCCCTGAAAGA 
CCTTGTGGCCACGGGTGTCATCGGGGCAGTGCTCTCAGCCATGGGTGTGGTGGGCATGGT 
GGGAAATGTATACACTTTGGTGGTCATGTGCCGGTTTCTGCGTGCCTCGGCCTCCATGTA 
CGTCTATGTGGTCAACCTAGCGCTG GCTGATCTGCTGTACCTGCTGAGCATTCCCTTCAT 
CATAGCC ACCTACGTCACTAAGGACTGGCACTTTGGAGATGTGGGCTGCAGAGTCCTCTT 
TAGCCTGG ACTTCCTGACAATGCACGCCAGCATCTTCACCCTGACCATAATGAGCAGCGA 
ACGCTATGCAGCCGTACTGAGGCCTCTGGACACAGTCCAGCGCTCCAAGGGTTACCGTAA 
GCTGCTGGTGCTGGGCACCT GGTOGCTH^^ 

CATCCAGCTGGTCCGCAGGGGCTCTAAGAGCCTCTGCCTGCCAGCCTGGGGCCCTCGTGC 

CCACCGTACTTACCTAACGTTGCTCTTTGGGACCAGCATTGTGGGGCCT 

TGGGCTGCTCTATGTCCGTCTGGCCAGGGCCTACTGGCTATCTCAGCAAGCTTCTTTCAA 

GCAGACACGGCGGCTGCCCAACCCCAGGGTGCTCTACCTCATCCTO 

CTTCTGGGCCTGCTOTCTACCCTTCTGGCTGTGGC^ 

CATGCCACTGACTCCCGAGACTGCACGCATTGTCAACTACCTGACCACCTGCCTCACTTA^ 

TGGCAACAGTTGCATCAATCCCTTGCTCTACACT CTGC TCACC AAGAAC TATCGAGAGTA 

CCTACGTGGCCGCCAGCGGTCACTGGGTAGTAGTTGCCACAGCCCAGGGAGTCCTGGCAG 

CTTCCTGCCCAGCCGAGTCCACCTCCAGCAGGACTCGGGCCGCTCGCTGTCCTCCAGCAG 

CCAACAGGCCACAGAGACCCTCATGCTGTCTCCAGTCCCCCGTAACGGGGCCCTTCTCTG 

AGAGTGCACTGTGCAATCCTGGCATAGGAAAGGACCCAAAGGCGTGCGGCTCCGGAGCGC 

ATTTCCCAGAATCCCCTGCTCAAACCTAACTGGCTCGTC 



FIGURE 12A 
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Gene Sequence Structure * 566 bp Sequence Deleted 798 bp 



Size of CDS: 1539 bp 





* Not drawn to scale 



5 ■ >ATGTGTCCCACAACAGTTCCT 

GGACTGGCCCAACAGATCCCAGCT 

CCCTGCAAGACCTTGTGGCCACCG 

GTGTCATCGGGGCAGTGCTCTCAA 

CCATGGGTGTGGTGGGCGTGGTGG 

GCAATGTGTACACTCTGGTGGTCA 

TGTGCCGATTTCTGCGTGCCTCGG 

CCTCCATGTACGTCTATGTGGTCA 

ACCTAGCGCTGO " 

(SEQK>NO:23) 



5 1 > CTGGTTGCTGGCACTGCTGCT 

GACCTTACCCATGATGCTTGCCAT 

CCGGCTGGTC CGTAGGGGCTCTAA 

GAGCCTCTGCCTGCCAGCCTGGGG 

CCCTCGTGCCCACCGTACCTACCT 

GACGCTGCTCTTTGGGACCAGCAT 

TGTGGGGCCTGGCCTGGTCATTGG 

GCTGCTCTATATCCGTCTGGCCAG 

GGCCTATTGGC < 3 1 

(SEQIDNO:24) 



FIGURE 12B 



SUBSTITUTE SHEET (RULE 26) 
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AAGCTTCCTTGAG&TTTCTAAAATTTATACAAAAACA^ 

tttggtgatggacccagaagaaacttcagtttatttggattattactatgctacgagcccaaactctgacatcag 
ggagacccactcccatgttccttacacctctgtcttccct 

GCTGGGGAACCTTGTTCTCATGGGAGCGTTGCATTTCAAACCCGGCAGCCGAAGACTGATCGACATCTTTATCAT 

CAATCTGGCTGCCTCTGACTTCATTTTTCTTGTCACATTGCCTCTCTGGGTGGATAAAGAAGCATC 

GTGGAGGACGGGCTCCTTCCTGTGCAAAGGGAGCTCCTACATGATCTCCGTCAATATGCACTGCAGTGTCCTCCT 

GCTCACTTGCATGAGTGTTGACCGCTACCTGGCCATTGTGTGGCCAGTCGTATCCAGGAAATTCAGAAGGACAGA 

CTGTGCATATGTAGTCTGTGCCAGCATCTGGTTTATCTCCTGCCTGCTGGGGTTGCCTACTCTTCTGTCCAG 

GCTCACGCTGATTGATGATAAGCCATACTGTGCAGAGAAAAAGGCAACTCCAATTAAACTCATATGGTCCCTGGT 

GGCCTTAATTTTCACCTTTTTTGTCC^ 

TGC CCATTAC C AGC AATC AGGAAAGC AC AACAAAAAGCTGAAGAAATCTATAAAGATC ATC TTTATTGTC GTGGC 
AGCCTTTCTTGTCTCCTGGCTGCCCTTCAATACTT^ 

CTATTTACCCTCAGCTATTCTTCAGCTTGGTATGGAGGTGAGTGGACCCTTGGCATTTGCCAACAGCTGTGTCAA 
CCCTTTCATTTACTATATCTTCGACAGCT 

CTATGACTTTGGGAGTAGCACTGAGACATCAGATAGTCACCTCACTAAGGCTC 
AGATTTTGCCAGGAGGAGG ^ GAGGTCTGTGTCACTC 

TTAGGAGTTAATTTTTGTCAGCAACAAAGAAA (SEQ ID NO:25) 



MDPEETSVYLDYYYATSPNSDIRETHSWPYTSWL^ 

AASDF I FLVTL PLWVDKEASLGLWRTGS FLC KGS S YMI SVNMHC SVLLLTCMSVDRYL ATVWPWSRKFRRTDC A 
YWCASIWFISCLLGLPTLLSRELTLIDDKPYCAEKKATPIKLIWSLVALIFTFFVPLLSTVTC 
YQQSGKHNKKLKKSIKIIFIWAAFLVSWLPFNTFKFLAIVSGLRQEHYLPSAILQLGMEVSG 
IYYIFDSYIRRArVHCLCPCLKNYDFGSSTETSDSHLTKALSTFIHAEDFARRRKRSVSL (SEQ ID NO:26) 



FIGURE 13 



WO 02/03793 



PCT/US01/21923 



20/33 



Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



AAGC TTCC TTGAGGTTTC TAAAATTTATAC AAAAAC ATCATATGTAAGTAAACTC ACCAG 

ATTTGGCATCTGCTCTTTGGTGATGGACCCAGAAGAAACTTCAGTTTATTTGGATTATTA 

CTATGCTAC GAGCCCAAACTCTGACATCAGGGAGACCCACTCCCATGTTCCTTACAC CTC 

TGTCTTCCTTCCAGTCTTTTACACAGCTGTGTTCCTGACTGGAGTGCTGGGGAACCTTGT 

TCTCATGGGAGCGTTGCATTTCAAACCCGGCAGCCGAAGACTGATCGACATCTTTATCAT 

CAATCTGGCTGCCTCTGACTTCATTTTTCTTGTCACATTGCCTCTCTGGGTGGATAAAGA 

AGGATCTCTAGGACTGTGGAGGACGGGCTCCTTCCTGTGCAAAGGGAGCTCCTACATGAT 

CTCCGTCAATATGCACTGCAGTGTCC TCCTGCTCACTTGCATGAGTGTTGACCGCTACCT 

GGCCATTGTGTGGCCAGTCGTATCCAGGAAATTCAGAAGGACAGACTGTGCATATGTAGT 

CTGTGCCAGCATCTGGTTTATCTCCTGCCTGCTGGGGTTGCCTACTCTTCTGTCCAGGGA 

GCTC AC GC TGATTGATGATAAGC C ATACTGTGC AGAGAAAAAGGC AACTCC AATTAAAC T 

CATATGGTCCCTGGTGGCCTTAATTTTCACCTTTTTTGTCCCTTTGTTGAGCATTC 

CTGCTACTGTTGCATTGCAAGGAAGCTGTGTGCCCATTACCAGCAATCAGGAAAGCACAA 

CAAAAAGCTGAAGAAATCTATAAAGATCATCTTTATTGTCGTGGCAGCCTTTCTTGTCTC 

CTGGCTGCCCTTCAATACTTTCAAGTTCCTGGCCATTGTCTCTGGGTTGCGGCAAGA^ 

CTATTTACCCT(^GCTATTCTTCAGCTTGGTATGGAGGTGAGTGGACCCTTGGCATTT^ 

CAACAGCTGTGTCAACCCTTTCATTTACTATATCTTCGACAGCTACATCCGCCGGGCCAT 

TGTCCACTGCTTGTGCCCTTGCCTGAAAAACTATGACTTTGGGAGTAGCACTGAGACATC 

AGATAGTC AC C TC ACTAAGGCTC TCTC CACCTTC ATTC ATGC AGAAGATTTTGCC AGGAG 

GAGGAAGAGGTCTGTGTCACTCTAAAGGGAACTGTGACATTTCAAGCTCTGTTGGTGGGT 

TTAGGAGTTAATTTTTGTCAGCAACAAAGAAA 



FIGURE 14A 



WO 02/03793 



21/33 



PCT/USO 1/21923 



Gene Sequence Structure * 448 bp Sequence Deleted 746 bp 

Size of CDS: 1 232 bp | M£ . — H^PMf", j£ ■ ' ?, J 



Targeting Vector* 
(genomic sequence) 

Construct Number: 2454 



Arm Length: 
5': 3.2 kb 
J: 1 kb 



LacZ-Neo 




* Not drawn to scale 



5 " >ATAGCTCTGCATTTCAAACGC 
GGC AAC C GAAGATTGATC GAC ATC 
TTTATCATCAACCTGGCTGCCTCT 
GACTTCATTTTCCTTGTCACAGTG 
C C TC TTTGGATGGATAAG G AAGCC 
TCTCTAGGATTATGGAGGACTGGC 
TC TTTC C TGTGC AAAGGC AGC TC C 
TATGTGATCTCCGTGAACATGCAC 
TGTAGTGTCTTO 1 
(SEQIDNO:27) 



5 f > TGTGTGC TC ATT AC C AGC AGT 

C GGGAAAGC ATAACAAGAAACTAA 

AGAAGTCCATAAAGATCGTTATTA 

TTGCGGTGGCGGCCTTCACCGTCT 

CCTGGGTACCCTTTAACACTTTCA 

AGCTCCTAGCCATTGTTTCAGGGT 

TCCAGCCAGAAGGCCTTTTTCACT 

C C GAGGC TCTGC AGCTGGC C ATGA 

ATGTGACTGGG<3 1 

(SEQIDNO:28) 



FIGURE 14B 

SUBSTITUTE SHEET (RULE 26) 



WO 02/03793 



22/33 



PCT7US01/21923 



GAGAAGTCAAACTTGGTATACCGGGAGAGGGAGAGATGAAAACAACTTCTCAGTTTCTCTGTGGCTGACAGAAGC 

TCCTGAGAGAGGCTTTCAGAACTTGATTCCAGCTGGCCCGTGAAAGCTGTGTCTGGACGGGAGAGCCTCAGAGGA 

ATAAACAGTGCCTTCTCTGGCTTCTCCGGGTCAAGACCAGTCTGTTCAGAAGGAAGTGGTGTTTCTCTC 

CTGTTCCCACAGCCCCCTGGATGGTAGATCATGGGGCTGCACCCTTCTCCACCATTGGGAACTCAG 

TATCTTTATCTCGCCAGCTTGTCCTCGTGCTTGACCAGCGCAGTTTCACTTTTGCAAACATC 

TGAGAGAAAAATATCAAGCAACCTGCCTCAAACGACCCTGTTTTGGCCGGTGAGCAAGGACAGCCTCCGATGGAT 

AACTACACAGTGGCCCCGGACGATGAATATGATGTCCTAATCTTAGACGACTACCTGGACAACAGTGGGCCGGAC 

CAAGTTCCGGCCCCCGAGTTCCTCTCCCCCCAGCAGGTGCTGCAGTTCTGCTGCGCGGTGTTTC 

TTGGACAACGTGCTGGCGGTGTTTATCTTGGTGAAATACAAAGGACTCAAGAATCTGGGGAACATCTACTTCCTA 

AACCTGGCACTTTC AAACCTGTGTTTCCTGCTTCC CCTGC CGTTCTGGGC CCATACTGC AGC ACACGGGGAAAGC 

CCTGGCAACGGGACCTGTAAAGTTCTTGTCGGACTCCACTCCTCGGGCTT^ 

CTCCTCCTTGTGCAAGGATACAGGGTGTTTTCCCAAGGGCGACTGGCCTCCATCTTCACGACAGTGTCTTGTGGT 
ATTGTTGCGTGCGTCCTGGCATGGGCCATGGCTACTGCGCTCT^ 

ATGGAAAGACAGAAACACAAGTGTGCCTTTGGCAAACCTCACTTCTTGCCAATCGAAGCGCCGCTCTGGAAGTAC 
GTTCTGACGTGAAAAATGATGATCTTGGTACTTC 

AGGAGAAGGC AGAGCTTC AGGG AGAGAC AGTAC GACC TC C AC AAGCCGGC TCTTGTC ATAACGGGC GTGTTC CTT 
TTGATGTGGGCGCCTTACAACACTGTGCTTTTCCTGTCTC 

AGCAGCTACCACCTGGACGCAAGTGTTCAGGTCACACAGCTGGTAGCGACCACCCACTGCTGCGTCAACCCGCTG 

CTCTATTTGCTTCTTGACCGGAAGGCCTTTATGAGATACCTTCGCAGCCTGTTCCCACGGTC 

TATCAAAGTAGTGGAGGCTCTCAGCAAGCGCCGCCAAGGAAAGGTCATGGCAGGCCCATTGAACTGTACAGC^ 

TTGC ATC AAAGGCAGG ATATAATATAATATAAACAC TC AACC TTGTC TCTTC AC ATTGTTTTATGC C ATTTTTAT 

ATGTTTGTATAAAAATAGAATATAAGAGGAAAGGGGGCTGGTTAATGAACATTTACTA 

CTCTGGCACACGGTTAAGTTATTTATAAAGGTGAATTCGACTCTTC 

TGGGGCGGTGTGCATTCCATAAACTGAAGCTTGGTAGTTGTCCAAAGTTACATAATTATAAAGTGGCAAAACTC 

TGTGTAAACTCAGAGATCACTGGCCCCAGAGTCTGTGAGTTATCCAAGCGTGCCAGCCTTTCAGTAACTGTCAGC 

TAGCAGTGGTGGCATGTCTGTCATCCCAGCACTCTAGGAG 

ACCTGACTTCCCCCAACCTCAATTCTTTTTTTTTTTGTTGTTG 

TATTTTTAAAGCAAAAAAAA (SEQIDNO:29) 



MDimVAPDDEYDVLILDDYLDNSGPDQVPAPEFLSPQQ 

FLNLALSNLC FLLPLPFWAHTAAHGES PGNGTCKVLVGLHS SGLYS EVFSNI LLLVQGYRVFSQGRIiAS I FTTVS 
CGrVACVLAWAMATALSLPESVFYTSPRME 

QLRRRQSFRERQYDLHKPALVITGVFLLMWAPYNTVLFLSAFQEHLSLQDEKSSYHLDA 
PLLYLLLDRKAFMRYLRSLFPRCNDIPYQSSGGSQQAPPRKGHGRPIELYSNLHQRQDII (SEQ ID NO:30) 



FIGURE 15 



WO 02/03793 



PCT7US01/21923 



23/33 



Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



GAGAAGTCAAACTTGGTATACCGGGAGAGGGAGAGATGAAAA.CAACTTCTCAGTTTCTCT 
GTGGC TGAC AGAAGCTCCTGAGAGAGGCTTTCAGAACTTGATTCCAGCTGGCC CGTGAAA 
GCTGTGTCTGGACGGGAGAGCCTCAGAGGAATAAACAGTGCCTTCTCTGGCTTCTCCGGG 
TCAAGACCAGTCTGTTCAGAAGGAAGTGGTGTTTCTCTGACCCCACTGTTCCCACAGCCC 
CCTGGATGGTAGATCATGGGGCTGCACCCTTCTCCACCATTGGGAACTCAGTCAAACGGT 
TATCTTTATCTCGCCAGCTTGTCCTCGTGCTTGACCAGCGCAGTTTCACTTTTGCAAACA 
TCTGTTTATATCCCTTGAGAGAAAAATATCAAGCAACCTGCCTCAAACGACCCTGTTTTG 
GCCGGTGAGCAAGGACAGCCTCCGATGGATAACTACACAGT GGCCCCGGACGATGAATAT 
GATGTCCTAATCTTAGACGACTACCTGGACAACAGTGGGCCGGACCAAGTTCCGGCCCCC 
GAGTTCCTCTCCCCCCAGCAGGTGCTGCAGTTCTGCTGCGCGGTGTTTGCGGTGGGTCTC 
TTGGACAACGTGCTGGCGGTGTTTATCT TGGTGAAATACAAAGGACTCAAGAATCTGGGG 
AACATCTACTTCCTAAACCTGGCACTTTCAAACCTGTGTTTCCTGCTTCCCCTGCCGTTC 
TGGGCCCATACTGCAGCACACGGGGAAA GCCCTGGCAACGGGACCTGTAAAGTTCTTGTC 
GGACTCCACTCCTCGGGC TTATACAGCGAGGTGTTTTCCAACATCCTCCTCCTTGTGCAA 
GGATACAGGGTGTTTTCC CAAGGGCGACTGGCCTCCATCTTCACGACAGTGTCTTGTGGT 
ATTGTTGC GTGC GTCC TGGC ATGGGC CATGGgTACTCC'GfVPfyrr'T^rT^r r r r: & grypfiTn 

TTTTATGAGCCTCGGATGGAAAGACAGAAACACAAGTGTGCCTTT 

TTGCCAATCGAAGCGCCGCTCTGGAAGTACGTTCTGACGTCAAAAATG 

CTTGCTTTTCCTCTGCTGGTTTTTATAATCTGCTGCA 

TTCAGGGAGAGACAGTACGACCTCCACAAGCCGGCTCTTGTCATAACGGGCGTGTTCCTT 
TTGATGTGGGCGCCTTAGAACACTGTGCTTTTC 

CTGCAGGATGAGAAGAGCAGCTACCACCTGGACGCAAGTGTTCAGGTCACACAGCT 

GCGACCACCCACTGCTGCGTCAACCCGCTGCTCTATTTGCTTCTTGACCGGAAGGCCTTO 

ATGAGATACCTTCGCAGCCTGTTCCCACGGTGCAATGATATCCCCTATCA^ 

GGCTCTCAGGAAGCGCCGCCAAGGAAAGGTCATGGCAGGCCGATTGAAC^ 

TTGCATCAAAGGCAGGATATAATATAATATAAACACTCAACCTTGTCT 

TTATGCCATTTTOATATGTCTGTATAAAAATAGAATAT^ 

TGAACATTTACTACTAAGCGTTGAATTTGTCTCTGGCACACGGTTAAGTTA 

GTGAATTCGACTCOTCCTCTTTAG^ 

TCCATAAACTGAAGCTTGGTAGTTGTCCAAAGTTAC^^ 

TGTGTAAACTCAGAGATCACTGGCCCCAGAGTCTGTGAGTTATC CAAGCGTGCC AGC CTT 
TC AGTAAC TGTC AGCTAGCAGTGGTGGCATGTC TGTC ATCC C AGCACTCTAGGAGGC GGA 
GGC AGGAGGCATC TCAC AAATGC AAGTATAACC TGAC TTC CC C C AACC TC AATTCTTTTT 
TTTTTTGTTGTTGTTGTTCATCAAAATACATGCAATAAATATATATATT^ 
AAAAA 



FIGURE 16A 



WO 02/03793 
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Gene Sequence Structure * 463 bp Sequence Deleted 925 bp 



Size of full-length 
cDNA: 2045 bp 




Targeting Vector* 
(genomic sequence) 

Construct Number: 285 



Arm Length: 
5': 1.3 kb 
3': 3.5 kb 



Neo 





* Not drawn to scale 



5 • >GACTCTTTCTGTACCAGGCAG 

TTTCTGTCACATTAGCAGCNCGGT 

CCCTGGGTGCTTAATGAGGAGATT 

TGAGTACGGCCGCCCCCGCTGGGG 

GTGGGGGGTGGGGGCATAGTTACT 

TTCAGTCTTTGGCTGTCGGATGGA 

GGGGAATCATGACCTCTGGTTTCC 

CCACAGGACAGCCTCCGATGGATA 

ACTACACAGTG<3 1 

(SEQIDNO:31) 



5 1 >CCATGGCTACTGCGCTCTCTT 

TGCCCGAGTCTGTGTTTTATGAGC 

CTCGGATGGAAAGACAGAAACACA 

AGTGTGCCTTTGGCAAACCTCACT 

TCTTGCCAATCGAAGCGCCGCTCT 

GGAAGTACGTTCTGACGTCAAAAA 

TGATCATCTTGGTACTTGCTTTTC 

CTCTGCTGGTTTTTATAATCTGCT 

GCAGGC AACTG< 3 1 

(SEQIDNO:32) 



FIGURE 16B 

SUBSTITUTE SHEET (RULE 26) 



WO 02/03793 



25/33 



PC17US01/21923 



GATTCTTTAGCATGTTGGCTGCTGGAGAAAAGTTCAGGGGCAGGCGACAAGGTCCGGGTGCTGAGCAAGGTGGCT 

GGACAAGCCTAGCGGGAGAGAAACCTGAGGCCACGAGCGACCTCCGGAGGAGCCCCTAGACGGGAGTCTTAGCTG 

AGACACGGC AGC ATGCAACCAGCCCAGATGCGCCTTGCAGAACGAGAGCC TATTTTCAACACTCOT C CGCAGTC 

GGGCCTGCACGCCTTCAGGAACCCACCACCACTCTCCCAATTGTTCCCTGAAATGAACGTTC 

TCGTGGACTGTTGAAGTAACCATCTGTCTCTAGGAAGCCATGGCAACCACGCATGCGCAGGGCCACCAGCCAGTC 

TTGGGAAATGACACTCTCCGGGAACATTACGATTACGTGGGGAAACTGGCAGGCAGGCTGCGGGACCCCCCTGAG 

GGTGGCACCCTCATCACCACCATCCTCTTCTTGGTCACCTGTAGCTTCATCGTCTTGGAGAACCTGATGGT 

ATTGCCATCTGGAAAAACAATAAATTTCACAACCGCATGTACTTTTTCATCGGCAACTTGGCTCT 

CTGGCCGGCATAGCATACAAGGTCAATATTCTTATGTCCGGCAGGAAGACGTTCAGTCTGTCTCCAACAGTGTGG 

TTCCTCAGGGAGGGCAGTATGTTCGTAGCCCTGGGCGCATCCACCTGCAGCTTACTGGCCATTGCCATTGAGCGA 

CACCTGACCATGATCAAGATGAGGCCATATGATGCCAACAAGAAGCACCGCGTGTTCCTTCTGATTGGGATGTGC 

TGGCTAATTGCCTTCTCGCTGGGTGCCCTGCCAATCCTGGGCTGGAACTGCCTGGAGAACTTTCCCGACTGCTCT 

ACCATCTTGCCCCTCTACTCCAAGAAATACATCGCCTTCCTCATCAGTATCTTCACCGCCATTCTG 

GTCATCTTGTATGCGCGCATCTACTGCCTGGTCAAGTCCAGCAGCCGCAGGGTGGCCAACCACAACTCTGAGAGA 

TCCATGGCCCTTCTGCGGACCGTAGTGATTGTGGTGAGTGTGTTCATTGCCTGTTGGTCCCCACTTTTTATCCTC 

TTCCTCATCGACGTGGCCTGCAGGGCAAAGGAGTGCTCC^ 

GTCCTCAACTCGGCCATGAACCCTGTCATCTACACGCTGGCCAGCAAAGAGATGAGGC GC GCCTTCTTCCGGTTG 

GTGTGCGGCTGTCTAGTCAAGGGCAAGGGGACCCAGGCCTCACCCATGCAGCCTGCCCTCGACCCAAGCAGAAGT 

AAGTCAAGCTCCAGTAACAACAGCAGCCACTCTCCGAAGGTCAAGGAAGACCTGCCCCGCGTGGCTA^ 

TGCATCATTGACAAAAACAGGTCGTTTCAGAACGGGGTCCTCTGCAAGTGACGGTCTCAACAGGGCAAGCTCTGC 

AGCCACACCTATTTATTGGATGGATCACTTCCACGTGGGGCCTTAAGAGATC 

GTGGCCAGACTGTGTGGTATCTCCTGAGGAGGAGACCCAAGGAGTC^ 

TGACTTATCTGTTGCGGGCTCCATCCTTCTGAATGTACCAAGCTGCTGAC 

TACCCCTGGTCCAAGGAAAGCAAGGTGGTGAAGAGCTATGCACATC 

TCACAC TAC ATTTATTTTCATAAAA (SEQ IDNO:33) 



MATTHAQGHQPVLGNDTLREHYDWGKLAGRLRD^ 

YFFIGNLALCDLLAGIAYKVNILM 

KKHRWLLIGMCWLIAFSLGALPILGVmCL 

SSRRVANHNSERSMALLRTVVI VVSVFIACWSPLF ILFL IDVACRAKECS ILFKSQWFIMLAVIiNSAMNrPVI YTL 
ASKEMRRAFFRLVCGCLVKGKGTQAS PMQPALDPSRSKSS S SNNS SHS PKVKEDLPRVATS SC I IDKNRSFQNGV 
LCK (SEQIDNO:34) 



FIGURE 17 



WO 02/03793 
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PCT/US01/21923 



Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



GATTCTTTAGCATGTTGGCTGCTGGAGAAAAGTTCAGGGGCAGGCGACAAGGTCCGGGTG 

CTGAGCAAGGTGGCTGGACAAGCCTAGCGGGAGAGAAACCTGAGGCCACGAGCGACCTCC 

GGAGGAGCCCCTAGACGGGAGTCTTAGCTGAGACACGGCAGCATGCAACCAGCCCAGATG 

CGCCTTGCAGAACGAGAGCCTATTTTCAACACTCTTCCCGCAGTCGGGCCTGCACGCCTT 

CAGGAACCCACCACCACTCTCCCAATTGTTCCCTGAAATGAACGTTCCTGGCGCGCCCTC 

TCGTGGACTGTTGAAGTAACCATCTGTCTCTAGGAAGCCATGGCAACCACGCATGCGCAG 

GGCCACCAGCCAGTCTTGGGAAATGACACTCTCCGGGAACATTACGATTACGTGGGGAAA 

CTGGCAGGCAGGCTGCGGGACCCCCCTGAGGGTGGCACCCTCATCACCACCATCCTCTTC 

TTGGTC AC CTGTAGCTTC ATC GTC TTGGAGAACCTGATGGTTTTGATTGCCATCTGGAAA 

AACAATAAATTTCACAACCGCATGTACTTTTTCATCGGCAACTTGGCTCTCTGCGACCTG 

CTGGCCGGCATAGCATACAAGGTCAATATTCTTATGTCCGGCAGGAAGACGTTCAGTCTG 

TCTCCAACAGTGTGGTTCCTCAGGGAGGGCAGTATGTTCGTAGCCCTGGGCGCATCCACC 

TGCAGCTTACTGGCCATTGCCATTGAGCGACACCTGACCATGATCAAGATGAGGCCATAT 

GATGCCAACAAGAAGCACCGCGTGTTCCTTCTGATTGGGATGTGCTGGCTAATTGCCTTC 

TCGCTGGGTGCCCTGCCAATCCTGGGCTGGAACTGCCTGGAGAACT 

ACCATCTTGCCCCTCTACTCCAAGAAATACATCGCCTTCCTCATCAGTATCTTCACCGCC 

ATTCTGGTGACCATCGTCATCTTGTATGCGCGCATCTACTGCCTGGTCAAGTCCAGCAGC 

CGC AGGGTGGC C AAC CAC AACTC TGAGAGATC C ATGGCCC TTCTGC GGACC GTAGTG ATT 

GTGGTGAGTGTGTTCATTGCCTGTTGGTCCCCACTTTTTATCCTCTO 

GCCTGCAGGGCAAAGGAGTGCTCCATCCTCTTCAA GAGTCAGTGGTO^ 

GTCCTGAACTCGGCCATGAACCCTGTCATCTACACGCTGGCC^ 

GCCTTCTTCTCGGTTGGTGTGCGGC^TCTAGTC^GGGCAAG 

ATGCAGCCTGCCCTCGACCCAAGCAGAAGTAAGTCAAGCTCCAGTAA 

TCTCCGAAGGTCAAGGAAGACCTGCCCCGCGTGGCTACTTCTTCCTGC^ 

AACAGGTCGTTTGAGAACGGGGTCCTCTGCAAGTGACGGTCT 

AGCGACACCTATTCATTGCATGGATGACTTCGACGTC TTAAC TC T 

GGAGGTCTGCTTGCTGTGGCCAGACTGTGTGGTATCTCCTGAGGAGGAGACCCAAGGAGT 
CACCAACGCATTTCAGCCTGGACTTGGATGTGACTTATCTGTTGCGGGCTCCATCCTTCT 
GAATGTACCAAGCTGCTGACACGGTGCACTGCCTTCGGATGCCGCTACCCCTGGTCCAAG 
GAAAGCAAGGTGGTGAAGAGCTATGCACATGCACATTTGGGATGATTGTCTCCCTTGCGT 
TCACACTACATTTATTTTCATAAAA 



FIGURE 18A 



WO 02/03793 
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PCT/US01/21923 



Gene Sequence Structure * 

Size of fulMength 
cDNA: 1825 bp 



1016 bp 



Sequence Deleted 



1174 bp 




Targeting Vector* 
(genomic sequence) 

Construct Number: 1763 



Arm Length: 
5": 3.3 kb 
3': 2.5 kb 



Neo 



r iHiwiiiiiWiiii : J vmm 





l-ife"*^'— — - Endogenous Locus''. 

* Not drawn to scale 



5 ■ >TGGGATGTGCTGGCTAATTGC 

CTTCTCGCTGGGTGCCCTGCCAAT 

CCTGGGCTGGAACTGCCTGGAGAA 

CTTTCCCGACTGCTCTACCATCTT 

GCCCCTCTACTCCAAGAAATACAT 

CGCCTTCCTCATCAGTATCTTCAC 

CGCCATTCTGGTGACCATCGTCAT 

CTTGTATGC GCGCATCTACTGCCT 

GGTCAAGTCCA<3 1 

(SEQIDNO:35) 



5 ■ >AGAGTCAGTGGTTCATCATGC 

TGGCTGTCCTCAACTCGGCCATGA 

ACCCTGTCATCTACACGCTGGCCA 

GCAAAGAGATGAGGCGCGCCTTCT 

TCCGGTTGGTGTGCGGCTGTCTAG 

TCAAGGGCAAGGGGACCCAGGCCT 

CACCCATGCAGCCTGCCCTCGACC 

CAAGCAGAAGTAAGTCAAGCTCCA 

GTAAC AAC AGC < 3 ' 

(SEQmNO:36) 



FIGURE 18B 

SUBSTITUTE SHEET (RULE 26) 



WO 02/03793 



28/33 



PCT/US01/21923 



CTGGAACAGCGGCGGCAGCCCTTGGAGGGGACCAGCGCAGCGGGGAGGTGAGAGGTGCTTCCTAGAGGGGAGCGG 
ACACCAAGAGAGGTCGCCAAGTGAGGCCTGGGGACGGGAGCGCCGGTCGAGTCAGCGCCAAACAGGGAGAGCCCA 
TTACGTGACTGTCCCGAGGGCTCGGGCTCCCAGCCCTTGTACTGCGCAAAACTGGGCGGTGAGAGCCGGCTGCCG 
GTGCGCTGATGAACTTTGTTTCCTCTAGAGCGTGATCTCGGGCAATGGCAGCAGACCTGGAACCCTGGAATAGCA 
CCATCAATGGCACCTGGGAGGGGGACGAACTGGGATACAAGTGTCGCTTCAACGAGGACTTCAAGTACGTGCTGT 
TGCCCGTGTCCTATGGCGTGGTGTGCGTGCTCGGGTTGTGCCTGAACGTCGTGGCTCTCTATATCTTCCTATGCC 
GCCTCAAAACCTGGAACGCCTCCACCACCTACATGT 

TGCCGCTGTTGGTTTATTACTACGCCCGGGGTGACCACTGGCCATTTAGCACGGTGCTCTGCAAGCTGGTGCGTT 
TCCTCTTCTACACCAACCTCTACTGCAGCATCCTCTTCCTCACCTGCATCAGCGTGCACCGGTGCCTGGGAGTCC 
TGCGCCCTCTGCACTCCCTGCGTTGGGGCCGGGCCCGTTATGCCCGCCGGGTGGCTGCGGTTGTGTGGGTGCTGG 
TGCTGGCCTGCCAGGCACCCGTGCTCTACTTCGTCACCACCAGCGTGCGGGGAACCCGGATCACTTGCCATGACA 
CCTCGGCCCGAGAGCTCTTTAGCCATTTTGTGGCTT^ 

TTTCCGTAATCCTGGTCTGTTACGTGCTTATGGCCAGGCGGCTGCTCAAACCGGCTTA 

TGCCTCGGGCCAAGCGCAAGTCTGTGCGCACCATTGCCTTGGTACTGGCCGTCTTCGCCCTCTGCTTTCT 

TCCACGTCACGCGCACCCTCTACTACTCCTTCCGATCACTC 

TGGCATATAAGATCACCCGGCCGCTGGCCAGCGCCAACAGTTGTCTTGACCCGGTACTCTACTTCCTGGCAGGGC 
AGAGAC TTGTCC GCTTTGCCCGAGATGCC AAGCCACCCACGGAGC CTACCCCCAGCCCAC AGGCTCGTCGCAAGC 
TGGGCCTGCACAGGCCTAACAGAACTGTGAGGAAAGATTTGTCAGTC 

CCACACCAGCTGGAAGTGAGACTAAGGACATTCGGCTATAGCACCTCAGTCAGGTGTCAGTTTATGTAGGGG 

TGTAGGGGACCAAGGCTTGTTAAAGGGGCTGATATCCTCTGATGCGGATCAACAGCTCTCACTTGGCAC^GCTC 

AGGATACTCATCTTGCTGTCCCAAGAGTAGCAGGACTCCAAAGTCCC^^ 

CATATCTTAAGAAAGGCAGTGCACCCGACTCCTATGCAGACAGCTGCCATTCCTGCCAAGGTACCTAGGCTGAGA 

TTC AGC C TCC C AGATC C AAGGGGGAAGC AGGCTC AGAGAGGAC AC AGC AGAGCCCAGGC TACC TGGATTAGGAGT 

AGAATGK^GAAAGGTGACTCCACCCCTCTTGTTCCCATGGTGATGTAGGATGGACTGGAC^ 

TAGCTCAGAGGAGGACCCTCAGGCGAAGAGACAAACCTCTGAGAGCTAATGTCAGGTCCCTGGAGACTTCC 

GCTGGCAGCAGGTGGGAACCATAACTTATATGAAAGNCTGTGCACCCATTGGGCTGTGCTCTGGGGT^ 

GGAAGCTGTCACCACCCACACAAAGGNNTGTTATC 

GGGTGTGTGTGTGTGTGTGTGTGGGGGGG^^ 

TCTCTTTATTCTTCCAAGCATGAGAGGAGCTAAATATG (SEQ ID NO:37) 



MAADLEPV^STINGTWEGDELGYKCRFNEDFKYVLLPVSYGWCVLGL^ 

AVSDSLYAASLPLLVYYYARGDHWPF STVLCKLVRFL FYTNLYC S ILPLTC I SVHRCLGVLRPLHSLRWGRARYA 
RRVAAWW/LVIxACQAPVLYFVTT 

LKPAYGTTGGLPRAKRKSVRTIALVLAVFAI^FLPFH^ 

LDPVLYFLAGQRLVRFARDAKP PTEPTPS PQARRKLGLHRPNRTVRKDLSVS SDDS RRTESTPAGSETKDI RL 
(SEQIDNO:38> 



FIGURE 19 



WO 02/03793 
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Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



CTGGAACAGCGGCGGCAGCCCTTGGAGGGGACCAGCGCAGCGGGGAGGTGAGAGGTGCTT 
C CTAGAGGGGAGC GGAC ACC AAGAGAGGTC GC C AAGTGAGGC C TGGGGAC GGGAGC GC C G 
GTCGAGTCAGCGCCAAACAGGGAGAGCCCATTACGTGACTGTCCCGAGGGCTCGGGCTCC 
CAGCCCTTGTACTGCGCAAAACTGGGCGGTGAGAGCCGGCTGCCGGTGCGCTGATGAACT 

CCATCAATGGCACCTGGGAGGGGGACGAACTGGGATACAAGTGTCGCTTC AACGAGGACT 

TCAAGTACGTGCTGTTGCCCGTGTCCTATGGCGTGGTGTGCGTGCTCGGGTTGTGCCTGA 

ACGTCGTGGCTCTCTATATCTTCCTATGCCGCCTCAAAACCTGGAACGCCTCCACCACCT 

ACATGTTTCACCTGGCAGTTTCGGACTCTCTCTACGCAGCGTCCCTGCCGCTGTO 

ATTACTACGCCCGGGGTGACCACTGGCCATTTAGCACGGTGCTCTGCAAGCTGGTGCGTT 

TCCTCTTCTACACCAACCTCTACTGCAGCATCCTCTT CCTCACOTGCATCAGCGTGCAgg 

GGTGCCTGGGAGTCCTGCGCCCTCTGCACTCCCTGCGTTGGGGCCGGGCCCGTTATGCCC 

GCCGGGTGGCTGCGGTTGTGTGGGTGCTGGTGCTGGCCTGCCAGGCACCCGTGCTCTACT 

TCGTCACCACCAGCGTGCGGGGAACCCGGATCACTTGCCATGACACCTCGGCCCGAGAGC 

TCTTTAGCGATTTTGTGGCTTACAGCTCCGTG 

TTTCCGTAATCCTGGTCTGTTACGTGCTTATGGCCAGGCGGCTGCTCAAACCGGCTTATG 

GGACCACAGGAGGTCTGCCTCGGGCCAAGCGCAAGTCTGTGCGCACCATTGCCTTGGTAC 

TGGCCGTCTTCGCCCTCTGCTTTCTGCCTTTCCACGTCACGCGCACCCTCTAC 

TCCGATCACTTGACCTCAGCTGCCACACCCTCAACGCCATCAACATGGCATATAAGATCA 

CCCGGCCGCTGGCCAGCGCCAACAGTTGTCTTGACCCGGTACTCTACTTCCTGGCAGGGC 

AGAGACTTGTCCGCTTTGCCCGAGATGCCAAGCCACCCACGGAGCCTACCCCCAGCCCAC 

AGGCTCGTCGCAAGCTGGGCCTGCACAGGCCTAACAGAACTGTGAGGAAAGATTTGTCAG 

TC AGC AGTGACGACTC AAGAC GGAC AGAGTC C AC AC C AGCTGG AAGTGAGAC TAAGGACA 

TTCGGCTATAGCACCTCAGTCAGGTGTCAGTTTATGTAGGGGAGCTGTAGGGGACCAAGG 

CTTGTTAAAGGGGCTGATATCCTCTGATGCGGATCAACAGCTCTCACTTGGCAGGGGCTC 

AGGATACTCATCTTGCTGTCCCAAGAGTAGCAGGACTCCAAAGTCCCAATCATTTGTACG 

TGTGAGTTGGGAAACCATATCTTAAGAAAGGCAGTGCACCCGACTCCTATGCAGACAGCT 

GCCATTCCTGCCAAGGTACCTAGGCTGAGATTCAGCCTCCCAGATCCAAGGGGGAAGCAG 

GCTCAGAGAGGACACAGCAGAGCCCAGGCTACCTGGATTAGGAGTAGAATGGGAAAGGTG 

ACTCCACCCCTCTTGTTCCCATGGTGATGTAGGATGGACTGGACCTGGGCACGATGGACT 

TAGCTC AGAGGAGGAC CCTCAGGCCAAGAGAC AAAC CTC TG AGAGCTAATGTC AGGTC C C 

TGGAGACTTCCCCTGGCTGGCAGCAGGTGGGAACCATAACTTATATGAAAGNCTGTGCAC 

CCATTGGGCTGTGCTCTGGGGTATGACCTGGGAAGCTGTCACCACCCACACAAAGGNNTG 

TTATCTCTGCCTCCTNCCTTCCTTCAGTAGTAGGAAGTTTTGGATGGG 

TGTGTGTGGGGGGGGTGATCCCTTCTCAATCCCCCAGGTCAACCCTCCTTTC 

TCTCTTTATTCTTCCAAGCATGAGAGGAGCTAAATATG 
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Gene Sequence Structure * 351 bp 

1 



Sequence Deleted 



636 bp 



Size of full-length 
cDNA: 2138 bp 




Targeting Vector* 
(genomic sequence) 

Construct Number: 2085 



Arm Length: 
5': 5 kb 
3': 2 kb 



Neo 





* Not drawn to scale 



5 " > AAGCAAGC GATCTATCACCAA 

GCTCTACCTCCAGCCTTAGGCAAA 

TCTTAGCAAGTCCCCTCCCCCTCT 

GAGC CC AGAC ATGAC AGC TTTC TT 

ACCACACCTCCCTTTCCTGCAGGG 

CAATGGCAGCGGACCTGGAACCCT 

GGAATAGCACCATCAATGGCACCT 

GGGAGGGGGACGAACTGGGATACA 

AGTGTCGCTTC < 3 » 

(SEQIDNO:39) 



5 ' >TCCTCACCTGCATCAGCGTGC 

ACCGGTGCCTGGGAGTCCTGCGCC 

CTCTGCACTCCCTGCGTTGGGGCC 

GGGCCCGTTATGC CCGCC GGGTGG 

CTGCGGTTGTGTGGGTGCTGGTGC 

TGGCCTGCCAGGCGCCCGTGCTCT 

ACTTCGTCACCACCAGCGTGCGGG 

GAACCCGGATCACTTGCCATGACA 

CCTCGGCCCGAO ' 

(SEQIDNO:40) 
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CTGCGGGTCAAGCTGACAGTCAGATATAGAACGGTTACCACTGAAAG^ 

TGGCTAGTTTCCTTTCTGGTGGGGCTGACCCCCATGT^ 

AATAGCAGCACTCTTTTATGCCACTTCCGTTCCGT 

TGGATCCTCGTCCCCCTGGTTGTCATGTGTGTCATCTACCTAGACATCTTCTACATCATCCGAAATAAGCTCAGT 
CAAAACCTGTCTGGCTTCAGAGAGACGCGTGCATTTTATGGACGGGAGTTCAAGACAGCTAAGTCCCTGTTTCTG 
GTTCTCTTCTTG (SEQIDNO:41) 



LRVKLTVRYRTVTTQRRI WLFLGLC WLVS FLVGLTPMFGWNRKATLAS SQNS STLLCHFRSWSLDYMVFF SFVT 
WILVPLVVMCVIYLDIFYIIRNKTiSQNLSGFRETRAFYGREFKTAKSLFLVLFL (SEQ ID NO:42) 
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Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



CTGCGGGTCAAGCTGACAGTCAGATATAGAACGGTTACCACTCAAAGAAGAATATGGCTA 
TTCTTGGGCCTTTGCTGGCTAGTTTCCTTTC 

AATAGAA AAGCAACCTTAGCGAGCTCTCAAAATAGCAGCACTCTTTTATG CCACOTCCGT 

TCCGTGGTCAGTTTGGATTACATGGTCTTCTTCAGCTTCGTCACCTGGATCCTCGTCCCC 

CTGGTTGTCATGTGTGTCATCTACCTAGACATCTTCTACATCATCCGAAATAAGCTCAGT 

CAAAACCTGTCTGGCTTCAGAGAGACGCGTGCATTTTATGGACGGGAGTT 

AAGTCCCTGTTTCTGGTTCTCTTCTTG 
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Gene Sequence Structure 5 

Size of partial 
cDNA: 387 bp 



81 bp 



Sequence Deleted 



169 bp 



ii.i.- J r:ti- .-.i * .-crura ^Ji:ai»R!siiUiiJia3u 




Targeting Vector* 
(genomic sequence) 

Construct Number: 2827 



Arm Length: 
5':1.6kb 
3': 5 kb 



LacZ-Neo 





* Not drawn to scale 



5 1 > GGATTTAGTC AGAGTGAC ATT 

AGGCCATGGCTATTTCACCTCAGC 

AGATTAGGGGATTGGGAGAGCTGA 

CTTTGTCTTTCTTGTTTTCTCCCA 

GGGAGTCATTTCATTGTTGTGAAC 

TCACCATTTTTCCCCCATTCAAAC 

AGATATAGAACGGTTACCACTCAA 

AGAAGAATATGGCTATTCTTGGGC 

CTTTGCTGGCT< 3 ■ 

(SEQIDNO:43) 



5 ' >GCCACTTCCGTTCCGTGGTCA 

GTTTGGATTACATGGTCTTCTTCA 

GCTTCGTCACCTGGATCCTCGTCC 

CCCTGGTTGTCATGTGTGTCATCT 

ACCTAGACATCTTCTACATCATCC 

GAAATAAGC TC AGTC AAAAC CTGT 

CTGGCTTCAGAGAGACGCGTGCAT 

TTTATGGAC GGGAGTTCAAGACAG 

CTAAGTCCCTG<3» 

(SEQIDNO:44) 
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Continuation of Box 1.2 
Claims Nos.: 16, 32, 51 and 52 



Present claims 15, 32, 51 and 52 each relate to a compound defined by 
reference to a desirable characteristic or property, namely that it 
ameliorates a phenotype associated with a disruption in a nuclear hormone 
receptor gene, that it modulates nuclear hormone receptor expression, 
that it modulates a behavior associated with a disruption in a nuclear 
hormone recptor gene, or that it modulates nuclear hormone receptor gene 
function. The claims cover all compounds having this characteristic or 
property, whereas the application provides support within the meaning of 
Article 5 PCT and/or disclosure within the meaning of Article 5 PCT for 
no such compound. In the present case, the claims so lack support, and 
the application so lacks disclosure, that a meaningful search over the 
whole of the claimed scope is impossible. Independent of the above 
reasoning, the claims also lack clarity (Article 5 PCT).. An attempt is 
made to define a compound by reference to a result to be achieved. Again, 
this lack of clarity in the present case is such as to render a 
meaningful search over the whole of the claimed scope impossible. 
Consequently, no search can be carried out for such speculative claims, 
the wording of which is a mere recitation of the results to be achieved. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 17-52 (all totally) and 1-16 (all partially) 

A targeting construct comprising a first polynucleotide 
sequence homologous to a target gene, a second 
polynucleotide sequence homologous to the target gene, and a 
selectable marker; a method of producing said targeting 
construct; a cell comprising a disrupted said target gene; a 
non-human transgenic animal comprising a disruption in said 
target gene; a cell derived from said transgenic animal; a 
method of producing a transgenic mouse comprising a 
disruption in said target gene; a method of identifying an 
agent that modulates the expression of said target gene 
comprising the use of said transgenic animal or of said 
cell; a method of identifying an agent that modulates the 
function of said target gene comprising the use of said 
transgenic animal or of said cell; Said subject-matter 
wherein said target gene is a melanocortin-3 receptor gene. 



2. Claims: 53-78 (all totally) and 1-16 (all partially) 

As for subject 1, but wherein said target gene is a 5HT-2B 
gene. 



3. Claims: 79-108 (all totally) and 1-16 (all partially) 

As for subject 1, but wherein said target gene is a 
chemokine receptor 9A gene. 



4. Claims: 109-143 (all totally) and 1-16 (all partially) 

As for subject 1, but wherein said target gene is a 
glucocorticoid-induced receptor gene. 



5. Claims: 1-16 (all partially) 

As for subject 1, but wherein said target gene is an orphan 
GPR10 gene. 



6. Claims: 1-16 (all partially) 

As for subject 1, but wherein said target gene is an orphan 
GPR14 gene. 



7. Claims: 1-16 (all partially) 

As for subject 1, but wherein said target gene is an orphan 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 

GPR15 gene. 

8. Claims: 1-16 (all partially) 

As for subject 1, but wherein said target gene is a beta 
chemokine receptor (E01) gene. 

9. Claims: 1-16 (all partially) 

As for subject 1, but wherein said target gene is an 
endothelial differentiation 6PCR3 (EDG3) gene. 

10. Claims: 1-16 (all partially) 

As for subject 1, but wherein said target gene is an ATP 
receptor P2U1 gene. 

11. Claims: 1-16 (all partially) 

As for subject 1, but wherein said target gene is a 
adenosine 3 receptor gene. 
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